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- I f)'(j 2DG? 

TITLE: FAMILY 6H 61 XYLANASES " * ^'^ 

PVS 

FIELD OF THE INVENTION 

The present invention relates to xylanase enzymes belonging to the GH 61 family or 
homology group of polypeptides. The invention further relate to polynucleotides encoding 
5 said xylanases. to nucleic acid constructs comprising such polynucleotides and to expression 
vectors and recombinant host cells comprising such constructs. The invention also relate to 
processes for preparing said xylanases, to compositions comprising said xylanases and to 
the use of said xylanases in various applications. 

BACKGROUND OF THE INVENTION 

10 Carbohydrates and glycoconjugates are substrates for glycosyl transferases (GTs) 

and glycoside hydrolases (GHs). The structure of glycoside hydrolases began to be solved 
starting from the 1980s. At the same time, new GH proteins were discovered and their amino 
acid sequence determined. Two main obsen^atlons emerged from the new data. 1) The clas- 
sical E.G. nomenclature system for naming enzyme families was not precise enough to das- 

15 sify the increasing number of enzymes that had different structure yet perfonned the same 
enzymatic reaction. 2) enzymes related by homology could have different enzymatic activity 
thus also making the B.C. nomenclature system confusing for these related enzymes. A new 
family based nomenclature system was proposed by Bernard Henrissat in 1991 based on the 
structure of the enzymes (Henrissat B., A classification of glycosyJ hydrolases based on 

20 amino-add sequence similarities. Biochem. J. 280:309-316(1991); Henrissat B.. Bairoch A. 
New families in the classification of glycosyl hydrolases based on amino- acid sequence 
similarities. Biochem. J. 293:781-788(1993); Henrissat B., Bairoch A. Updating the se- 
quence-based dassification of glycosyl hydrolases. Biochem. J. 316:695-696(1996) and Da- 
vies G., Henrissat B. Structures and mechanisms of glycosyl hydrolases. Structure 3:853- 

25 859(1995).). The classification of glycoside hydrolases in families based on amino acid se- 
quence similarities was introduced because there is a direct relationship between sequence 
and folding similarities, and such a classification is expected to: 

(i) reflect the structural features of the enzymes, which cannot be reflected by the substrate 
^ 30 specifidty alone, 

(ii) help to reveal the evolutionary relationships between the enzymes, and 

(iii) provide a convenient tool to derive mechanistic infonmatlon. 

Amino acid sequences grouped by nature of their similarity to a particular GH family 
35 can give ideas as to the activity of the new hypothetical protein. Some of these amino add 
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sequences, grouped in a GH family by homology have later been suggested to have certain 
enzymatic activity. So. in short, grouping a new amino acid sequence in a GH family does 
not specifically indicate the exact enzymatic activity. The enzymatic activity must be demon- 
strated by an activity assay of the cloned or purified protein. If the assay is difficult determine- 
5 tion of the proteins actual function can remain un-revealed for years. 

Publicly available information on the GH-61 family counts presently only 6 nucleo- 
tide sequences of unknown function. One document discloses, however, a guess that one of 
these sequences (SwissProt sequence 014405) encode an endogfucanase enzyme (Salo- 
heimo M., Nakari-Setaelae T., Tenkanen M.. Penttilae M,; (1997) "cDNA cloning of a Tricho- 

10 derma reesei cellulase and demonstration of endoglucanase activity by expression in yeast."; 
Eur. J. Biochem. 249:584-591(1997). Work by the same group confinrned that the enzyme 
GH61A, when purified showed very weak cellulase activity. The group itself admitted that 
since the activity was three orders of magnitude lower than normal cellulases, that perhaps 
the cellulose was not the conrect native substrate for the enzyme. The group also made an 

15 exhaustive study of the purified enzyme with all other known carbohydrate assays (man- 
nanase. galactanase etc.) and found that the enzyme had no activity for these substrates. 
The authors conclude In their discussion that: " It would is therefore unlikely that the fungus 
would produce Cel61A for its endoglucanase activity when it is already producing more effi- 
cient endoglucanses....]t is possible that both TrCel61A and AbCel61A are active against 

20 specific parts of more complex natural cellulosic substrate. However, further studies are 
needed to reveal the function of the glycoside hydrolase 61 enzymes." (page 6505). 

Presently, the web-site of CAZY (http://afmb.cnrs-mrs.fr/CAZY/ ) lists the GH-61 fa- 
miliy as unclassified, meaning that properties like mechanism, catalytic nucleophile/base, 
catalytic proton donors, and 3-D structure are not known for enzymes belonging to this fam- 

25 ily. Here the only listed known activity is endoglucanase activity. 

Despite extensive screening of Trichoderma reesei recombinant yeast libraries for 
cellulases and recovering many other cellulases from other GH families we have not unam- 
biguously identified any endoglucanase belonging to the GH-61 family from Trichoderma 
reesei thus also indicating that GH61. if that family does include cellulases. have only very 

30 weak activity and thus cannot be detected by the nomially very sensistive recombinant yeast 
activity screening. Hence, the present 6 publicly disclosed nucleotide sequences belonging 
the GH-61 family are either unknown open reading frames sharing homology to SwissProt 
sequence 014405 or sequences cloned based on purification and sequencing of a cellulose 
induced gene (Isolation and characterization of a ceilulose-growth-specific gene from Agari- 

35 cus blsporus.'';Gene 119:183-190(1992), and besides the Cel61A protein mentioned above, 
there is in the art no knowledge of the function and properties of any protein or peptide be- 
longing to the GH-61 familily nor have any enzyme and/or its function been reliably demon- 
strated. 
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SUMMARY OF THE INVENTION 

Due to our efforts searching for new carbohydrases we have now for the first time discovered 
xylanase enzymes belonging to the GH-61 family. 

Hence, in a first aspect the invention provide an teolated GH-61 polypeptide having 
5 xylanase activity. 

In further aspects the invention provides a polynucleotide encoding the xylanase of 
the invention; a nucleic acid construct comprising the polynucleotide encoding the xylanase, 
operably linked to one or more control sequences that direct the production of the xylanase a 
suitable host; a recombinant expression vector comprising the nucleic acid construct of the 
10 invention and to a recombinant host cell comprising the nucleic add construct of the 
invention. 

In a still further aspect the Invention provides a composition comprising the xylanase 
polypeptide. 

In a still further aspect the invention provides a transgenic plant comprising a 
15 nucleotide sequence of the invention and capable of expressing the xylanase of the 
invention. 

In still further aspects the invention provides methods for producing a xylanase 
Including a method comprising: 

(a) cultivating a strain, which in its wild-type form is capable of producing the xylanase, to 
20 produce the xylanase; and 

(b) recovering the xylanase 
and a method comprising: 

(a) cultivating a recombinant host cell of the Invention under conditions conducive for 
production of the xylanase; and 
25 (b) recovering the xylanase. 

In a still further aspects the invention relates to a method for hydrolysing xylan 
comprising contacting a xylan substrate with a xylanase of the invention. 

BRIEF DESCRIPTION OF DRAWINGS 

30 No drawings 

DETAILED DESCRIPTION OF THE INVENTION 



Definitions 

The term ''xylanase" as used herein is to be understood as a polypeptide capable of 
hydrolysing xylan substrate. The xylan may be purified from several different sources for 
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which the degree of acetylation and characteristics of side chains will vary depending on the 
source material and the purification method. 

The term "xylanase activity" as used herein is to be understood as the ability of a xy- 
lanase to convert a xylan substrate into hydrolysis products. The activity is defined as the 
5 amount of xylan (normally given in Mmol) which can be converted into product per time unit 
(normally seconds) per amount of xylanase (normally grams of xylanase) measured at speci- 
fied conditions (nomnally optimal pH. temperature and solvent conditions for the xylanase). 
The concept of enzyme activity is known to the skilled person. 

The temi "GH" and "GH family or homology groups" as used herein is to be under- 
10 stood as a Glycoside Hydrolases, classified in accordance with the well established CAZY 
classification system. 

The term "GH-61" as used herein, is to be understood as a family of enzymes, which share 
common conserved sequence portions and foldings to be classified in family 61 of the well 
established CAZY GH classification system. In a preferred embodiment mature GH-61 xy- 
15 lanases of the invention share the following conserved portions: 

H at position 1, 

A or P at position 59, 

G at position 60, 
20 G at position 75, 

P or A at position 76, 

W or F at position 100, 

F or T at position 101, 

K or C at position 102, 
25 I or V or L at position 1 03, 

L or I or V or M at position 130, 

Pat position 131, 

G, Xaa, Y at position 137-139 

L or V or I or M at position 140 
30 L or V or I or M at position 141 

Rat position 142 

E or Q at positions 143-144, 

L or V or I at position 148 

H or N at position 149 
35 C at position 163 and 

P, G and P at position 209-21 1 . 
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In the present context "Xaa" means any amino add. Also in the present context the 
numi3ering system for the residues is based on SEQ ID NO:2 wherein the N terminal has 
been removed to the first histidine of the polypeptide (there after denoted Histidine at position 
1 ). This numbering system of residues may be applied to other GH61 proteins through a 
5 multiple sequence alignment with SEQ ID NO: 2 starting fn3m that first Histidine, disregarding 
gaps generated by the multiple sequence alignment. The relative location of Histidine-1 may 
thus be easily determined by multiple sequence alignment with SEQ ID No:2. Alignment may 
be perfomned by using AlignX in the computer program Vector NTI ver. 7.1 (Infonnax inc., 
7600 Wisconsin Avenue, Suite #1100, Bethesda, MD 20814, USA). The amino add 

10 alignment is created using the Clustal W algorithm (Nudeic Acid Research, 22 (22): 4673- 
4680. 1994) and the following additional parameters: Gap opening penalty of 10, Gap 
extension penalty of 0.05, Gap separation penalty range of 8. Painvise alignment parameters 
were Ktuple = 1. gap penalty = 3, gap length opening penalty = 10, gap extension penalty = 
0.1 , window size = 5 and diagonals = 5. 

15 The temri "core sequence" as used herein is to be understood as the sequence of the 

mature xylanase catalytic domain excluding other functional domains such as binding 
domains et. 

The temn "conserved portion" as used herein Is to be understood as an amino acid or 
amino acid subsequence contained in all GH-61 xylanases of the invention and thus a com- 
20 mon feature of all said xylanases. The term conserved portions is also used for nudeotides 
and variants thereof which by virtue of the degeneracy of the code encodes conserved amino 
add portions. 

The term "Identity" as used herein, is to be understood as the homology between two 
amino acid sequences or between two nucleotide sequences.. For purposes of the present 

25 invention, the degree of identity between two amino acid sequences is determined by using 
AlrgnX in the program of Vector NTI ver. 7.1 (Infonnax Inc., 7600 Wisconsin Avenue 
Suite #1100. Bethesda, MD 20814, USA). Amino add alignment is created using the Clustal 
W algorithm (Nudeic Add Research. 22 (22): 4673-4680, 1994). The homology score matrix 
used in the first case was blosum62 and in the second case, exact identity of the amino acid 

30 sequences in the alignment was used. The following additional parameters were used: Gap 
opening penalty of 10, Gap extension penalty of 0.05, Gap separation penalty range of 8. 
PaiHA^ise alignment parameters were Ktuple = 1, gap penalty = 3, gap length opening penalty 
= 10, gap extension penalty = 0.1, window size = 5 and diagonals 5. 

The degree of identity between two nucleotide sequences may be determined using 

35 the same algorithm and software package as described above for example with the following 
settings: Gap penalty of 10, and gap length penalty of 10. Pairwise alignment parameters 
were Ktuple=3, gap penalty=3 and windows=20. 
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The term 'Yragmenf as used herein about a fragment of an xylanase of the invention, 
is to be understood as a polypeptide having one or more amino acids deleted from the amino 
and/or carboxyl termrnus of the amino acid sequence of the xylanase. while retaining the car- 
bohydrate activation of the xylanase. 
5 The term "allelic variant" as used herein about allelic variants of a polynucleotide of 

the invention is to be understood as any of two or more alternative fomns of a gene occupy- 
ing the same chromosomal locus. Allelic variation arises naturally through mutation, and 
may result in polymorphism within populations. Gene mutations can be silent (no change in 
the encoded polypeptide) or may encode polypeptides having altered amino acid sequences, 

10 An allelic variant of a polypeptide is a polypeptide encoded by an aireilc variant of a gene. 

The tenm "modlficationCs)" as used herein about modified polypeptides or modified 
polynucleotides is intended to mean any chemical modification of the xylanase as well as 
genetic manipulation of the polynucleotide encoding the xylanase. The mod[fication(s) can be 
replacement(s) of the amino add side chain(s). substitution(s), deletion(s) and/or inser- 

15 tions(s) in or at the amino acid(s) of interest. 

The term "artificial variant" as used herein is to be understood as an modified xy- 
lanase capable of activating cart>ohydrates, which has been produced by an organism which 
is expressing a modified gene as compared to the unmodified polynucleotide encoding the 
unmodified xylanase The modified polynucleotide (gene), from which said variant is pro- 

20 duced when expressed in a suitable host, is obtained through human intervention. 

The term "cDNA" as used herein is intended to mean a DMA molecule which can be 
prepared by reverse transcription from a mature, spliced. mRNA molecule derived from a eu- 
karyotic cell. cDNA lacks the Intron sequences that are usually present in the conesponding 
genomic DNA. The initial, primary RNA transcript is a precursor to mRNA and it goes through 

25 a series of processing events before appearing as mature spliced mRNA. These events in- 
clude the removal of intron sequences by a process called splicing. When cONA is derived 
from mRNA it therefore lacks intron sequences. 

The term "nucleic add construcT as used herein is to be understood as a nucleic acid 
molecule or polynucleotide, either single- or double-stranded, which is isolated from a natu- 

30 rally occurring gene or which has been modified to contain segments of nucleic acids In a 
manner that would not otherwise exist in nature. The term nucleic acid construct is synony- 
mous with the term "expression cassette" when tiie nudeic add construct contains the con- 
trol sequences required for expression of a coding sequence of the present invention. 

The term "control sequence" as used herein is to be understood as nudeotide se- 

35 quence induding all components, which are necessary or advantageous for the expression 
of a polypeptide of the present invention. Each control sequence may be native or foreign to 
tiie nucleotide sequence encoding the polypeptide. Such control sequences include, but are 
not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal 
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peptide sequence, and transcription temninator. At a minimum, the control sequences in- 
clude a promoter, and trai:iscrlptionat and translatlonal stop signals. The control sequences 
may be provided with linkers for the purpose of introducing specific restriction sites facilitating 
ligation of the control sequences with the coding region of the nucleotide sequence encoding 
5 a polypeptide. 

The term "operably linked" as used herein is to be understood as a configuration in 
which a control sequence is appropriately placed at a position relative to a nucleotide se- 
quence encoding an )cylanase such that the control sequence directs the expression of the 
xylanase. 

10 The term "coding sequence" as used herein is to be understood as a nucleotide se- 

quence, which directly specifies the amino acid sequence of the xylanase. The boundaries 
of the coding sequence are generally determined by an open reading frame, which usually 
begins with the ATG start codon. The coding sequence typically includes DMA, cDNA, and 
recombinant nucleotide sequences. 

15 The term "expression** as used herein is to be understood as including any step in- 

volved in the production of the xylanase Including, but not limited to, transcription, pcst- 
transcriptional modification, translation, post-translattonal modification, and secretion. 

The term "expression vector" as used herein is to be understood as a polynucleotide 
molecule, linear or circular, that comprises a segment encoding an xylanase of the invention, 

20 and which is operably linked to additional segments that provide for its transcription. 

The term *'host cell" as used herein, includes any cell type which is susceptible to 
transformation with a nucleic acid construct. 

The terms "polynucleotide probe", "hybridization" as well as the various stringency 
conditions are defined infra. 

25 GH-61 xylanases 

The GH-61 xylanases of the invention all share the characteristics described supra. Prefera- 
bly, the mature GH-61 xylanase comprise in its naturally occurring fonm the conserved por- 
tions of 

30 a Hisitidine (H) at position 1 , 

A or P at position 59, 

G at position 60, 

G at position 75, 

P or A at position 76, 
35 W or F at position 1 00, 

F or T at position 101, 

K or C at position 102, 
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I or V or L at position 103, 

L or I or V or M at position 130, 

Pat position 131, 

G, Xaa and Y at position 137-139 

L or V or I or M at position 140, 

L or V or I or M at position 141, 

R at position 142. 

E or Q at positions 143-144. 

L or V or I at position 148. 

H or Nat position 149. 

C at position 163 and 

P and G and P at position 209-21 1 . 

when adopting the above mentioned numbering system. 

In particular the xylanase is an isolated xylanase meaning that a preparation of the 
xylanase contains at the most 90% by weight of other polypeptide material with which it may 
be natively assodated (lower percentages of other polypeptide material are prefenred. e.g. at 
the most 80% by weight, at the most 60% by weight, at the most 50% by weight, at the most 
40% at the most 30% by weight, at the most 20% by weight, at the most 10% by weight, at 
the most 9% by weight ,at the most 8% by weight, at the most 6% by weight, at the most 5% 
by weight, at the most 4% at the most 3% by weight, at the most 2% by weight, at the most 
1% by weight and at the most 72% by weight). Thus, it is prefen-ed that the isolated xylanase 
is at least 92% pure. i.e. that the xylanase constitutes at least 92% by weight of the total 
polypeptide material present in the preparation, and higher percentages are preferred such 
as at least 94% pure, at least 95% pure, at least 96% pure, at least 96% pure, at least 97% 
pure, at least 98% pure, at least 99%, and at the most 99.5% pure. In particular, it is pre- 
ferred that the xyfanases disclosed herein are in "essentially pure form", i.e. that the xylanase 
preparation is essentially free of other polypeptide material with which it is natively assod- 
ated. This can be accomplished, for example, by preparing the xylanase by means of well- 
known recombinant methods. 

The xylanase of the invention may be synthetically made, naturally occurring or a 
combination thereof. In a particular embodiment the xylanase of the invention may be ob- 
tained from a microorganism such as a prokaryotic cell, an archaeal ceil or a eucaryotic cell. 
The cell may further have been modified by genetic engineering (cf. Sources of GH-61 xy- 
tanases, vide infra). 

In another particular embodiment the xylanase of the invention have size from about 
5 kDa to about 500 kDa. in particular from about 10 kDa to about 250 kDa, more particularly 
from about 20 kDa to 100 kDa. 
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In a further embodiment, the xylanase of the invention may exhibit optimum xylan hy- 
drolysis at pH 5-9. 

In a still further embodiment, the xylanase of the invention may exhibit optimum xylan 
hydrolysis at a temperature within the range from about lO^C to about 80 ^'C, particularly in 
5 the range from about 20*^0 to about 60*C. 

In a stilJ further embodiment the xylanase of the invention may be functionallly stabile 
over at a temperature of up to 100 X in particular up to 80 ^^C, more particularly up to 60'*C. 

In a still further emtK)diment the xylanase is specific for xylan and in a particular em- 
bodiment the xylanase comprises a binding domain, such as a xylan binding domain (XBD). 

10 In a particular embodiment the xylanase exhibit at least 20%, in particular at least 

40%, such as at least 50%. in particular at least 60%, such as at least 70%, more particularly 
at least 80%, such as at least 90%, most particularty at least 95%, such as about or at least 
100% of the xylanase activity of either of the xylanases consisting of the amino acid se- 
quences shown as the mature xylanase of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6 

15 The xylanase preferably comprises or consists of. an amino add sequence which has 

a degree of identity to the amino adds of the mature xylanases of SEQ ID NO:2, SEQ ID 
NO:4 or SEQ ID NO:6 of at least 65%, particularly at least 70%. e.g. at least 75%. more par- 
ticularty at least 80%, such as at least 85%, even more particularty at least 90%, nru>st par- 
ticularty at least 95%, e.g. at least 96%, such as at least 97%, and even most particularty at 

20 least 98%, such as at least 99% (hereinafter "homologous xylanases**). with the proviso that 
the xylanase belong to the GH-61 family and that the xylanase preferably comprise the 
above mentioned conserved amino add portions. 

In an interesting embodiment, the amino add sequence differs by at the most ten 
amino adds (e.g. by ten amino acids), in particular by at the most five amino acids (e.g. by 

25 five amino acids), such as by at the most four amino acids (e.g. by four amino adds), e.g. by 
at the most three amino adds (e.g. by three amino acids) from amino adds of the mature xy- 
lanases of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6, observing the above mentioned 
proviso. In a particular interesting embodiment, the amino add sequence of the xylanase dif- 
fers by at the most two amino acids (e.g. by two amino adds), such as by one amino acid 

30 from amino adds of the mature xylanases OF SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6. 

Aligning the xylanases consisting of the amino add sequence shown as amino acids 
of the mature xylanases of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 with the dosest 
prior art and using the aligment method described above (see the section entitled "Defini- 
tions"), the dosest known sequence is Cel61A a Trichoderma reesei (Hypocrea jecorina) 

35 endo-1,4-glucanase IV. 

Particularty, the xylanase comprise the amino add sequence of the mature xylanases 
of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6; an allelic variant thereof; or a fragment 
thereof that has the xylanase activity. 
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In a further particular embodiment, the xylanase consists of amino acids of the ma- 
ture xylanases of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 

The xylanase of the invention may be a wild-type xylanase identified and isolated 
from a natural source. 

5 Furthemiore, the xylanase of the Invention may be prepared by the DNA shuffling 

technique, such as described in J.E. Ness et al. Nature Biotechnology 17, 893-896 (1999). 

The present invention also encompass artificial variants of the xylanases of the inven- 
tion, where said xylanases has been mutated by adding, substituting and/or deleting one or 
more amino adds from said xylanase polypeptides while retaining the xylanase activity and 

10 the conserved portions. Hence, the xylanase of the invention may be an artificial variant of a 
GH-61 xylanase, wherein at least one substitution, deletion and/or insertion of an amino add 
has been made to the amino acid sequence in a parent GH-61 xylanase amino add se- 
quence. In particular the artificial variant comprises, in particular consists of, an amino acid 
sequence that has at least one substitution, deletion and/or insertion of an amino acid as 

15 compared to amino adds the mature xylanases of SEQ ID NO:2, SEQ ID NO:4 of SEQ ID 
NO:6. In particular the mutation is made outside the conserved portion of the sequence or a 
substitution may be made within a conserved portion while maintaining the xylanase in the 
GH-61 family. For example for position 76 P may be substituted by A. while sill being con- 
served according to the definition. 

20 Such artificial variants may be constructed by standard techniques known in the art 

usually followed by screening and/or characterization. Standard techniques includes classical 
mutagenesis, e.g. by UV irradiation of the cells or treatment of cells with chemical mutagens 
as described by Gerhardt et al. (1994); in vivo gene shuffling as described in WO 97/07205; 
in vitro shuffling as described by Stemmer. (1994) or WO 95/17413. random mutagenesis as 

25 described by Eisenstadt E. et al., (1994); PGR techniques as described by Poulsen et al. 
(1991); family shuffling as described by J.E. Ness, et al. Nature Biotechnology, vol. 17, pp. 
893-896 (1999); site-directed mutagenesis as described by Sambrook et al. (1989), Sam- 
brook et al.. Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. A general de- 
scription of nucleotide substitution can be found in e.g. Ford et al., 1991, Protein Expression 

30 and Purification 2, p. 95-107. 

Such standard genetic engineering methods may also be used prepare a diversified 
library of variant nudeotide sequences from the genes encoding one or more parent xy- 
lanases, expressing the xylanase variants in a suitable host cell and selecting a preferred 
variant(s). A diversified library can be established by a range of techniques known to the art 

35 (Reetz MT; Jaeger KE, in Biocatalysis - from Discovery to Application edited by Fessner WD, 
Vol. 200. pp. 31-57 (1999); Stemmer. Nature, vol. 370, p.389-391. 1994; Zhao and Arnold. 
Proc. Natl. Acad. Sd., USA. vol. 94, pp. 7997-8000. 1997; or Yano et al.. Proc. Natl. Acad. 
Sd., USA. vol. 95. pp 551 1-5515, 1998). 
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In one embodiment of the invention, amino add changes (In the artificial variant as 
well as in wild-type polypeptides) are of a minor nature, that is conservative amino acid sub- 
stitutions that do not significantly affect the folding and/or activity of the protein: small dele- 
tions, typically of one to about 30 amino adds; small amino- or carboxyl-terminal extensions, 
5 such as an amino-tenminal methionine residue; a small linl^er peptide of up to about 20-25 
residues; or a small extension that facilitates purification by changing net charge or another 
function, such as a poly-histidine tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino adds 
(arginine, lysine and histidine), acidic amino acids (glutamic add and aspartic add), polar 
10 amino adds (glutamine and asparagine), hydrophobic amino adds (leudne, isoleucine, 
valine and methionine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and 
small amino acids (glydne, alanine, serine and threonine). Amino add substitutions which 
do not generally alter and or impair the function of a protein are l<nown in the art and are 
described, for example, by H. Neurath and R.L. Hill. 1979, In. The Proteins, Academic Press, 
15 New York, The most commonly occurring exchanges are Ala/Ser, Val/lle, Asp/Glu, Thr/Ser. 
Ala/Gly, Ala/Thr, Ser/Asn. Ala/Val, Ser/Gly. Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/lie, 
LeuA/al, Ala/Glu, and Asp/Gly as well as these in reverse. 

In an interesting embodiment of the invention, the amino add changes are of such a 
nature that the physico-chemical properties of the polypeptides are altered. For example. 
20 amino add changes may be performed, which improve the thermal stability of the polypep- 
tide, which alter the substrate spedficity, which changes the pH optimum, and the like. 

Particulariy, the number of such substitutions, deletions and/or insertions in the xy- 
lanase of the invention, particulariy those of the mature xylanases of SEQ ID NO:2, SEQ ID 
NO:4 or SEQ ID NO:6 to produce an artificial variant is at the most 10, such as at the most 9, 
25 e.g. at the most 8. more preferably at the most 7, e.g. at the most 6, such as at the most 5, 
most preferably at the most 4. e.g. at the most 3, such as at the most 2, in particular at the 
most 1. 

In a particular embodiment the artifidal variant is a variant, which has an altered, 
preferably reduced, immunogenicity, especially allergenidty, in animals induding man as 

30 compared to a parent xylanase. The term "immunogenicity" in this context is to be under- 
stood as the artificial variant invoking a an altered, in particular reduced, immunological re- 
sponse when administered to an animal, including intravenous, cutaneous, subcutaneous, 
oral and intratracheal administration. The term "immunological response" in this context 
means that the administration of the artifidal variant causes an alteration in the immunoglo- 

35 bolin levels in the animal body, such as in IgE, IgG and IgM or an alteration in the cytokine 
level in the animal body. Methods for mapping immunogenic/antigenic epitopes of a protein, 
preparing variants with altered immunogenidty and methods for measuring an immunological 



11 



response is well known to the art and are described e.g. in WO 92/10755, WO 00/26230, 
WO 00/26354 and WO 01/31989. 

In a further embodiment, the present invention relates to GH-61 xylanases which are 
encoded by nucleotide sequences which hybridize under very low stringency conditions, par- 
ticulariy under low stringency conditions, more particulariy under medium stringency condi- 
tions, more particulariy under medium-high stringency conditions, even more particularly un- 
der high stringency conditions, and most particulariy under very high stringency conditions 
with a polynucleotide probe selected from the group consisting of 

(i) the complementary strand of nucleotides 52 to 699 of SEQ 10 NO:1, 46 to 957 of 

SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5, 
(il) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 

of SEQ ID NO:1 . 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3, 

(iv) the complementary strand of nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of 
SEQ ID NO:3 or 58 to 300 of SEQ ID NO:5 and/or 

(v) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1, 502 to 957 of 
SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5. 

(J. SambFOok, E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual, 
2d edition. Cold Spring Harbor, New York). 

In particular, the xylanase of the invention is encoded by a polynucleotide comprising 
the nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ ID 
NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID NO:1, 
46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the 
genetic code. More particularly, the xylanase of the invention is encoded by a polynucleotide 
consisting of nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of 
SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID 
NO: 1 , 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of 
the genetic code. 

The nucleotide sequences of nucleotides SEQ ID NO:1, SEQ ID NO:3 or of SEQ ID 
NO:5 or a subsequence thereof, as well as the amino acid sequences of SEQ ID NO:2. SEQ 
ID NO:4 or SEQ ID NO:6 or a fragment thereof, may be used to design a polynucleotide 
probe to identify and clone DNA encoding GH-61 xylanases of the invention from strains of 
different genera or species according to methods well known in the art. In particular, such 
prot>es can be used for hybridization with the genomic or cDNA of the genus or species of 
interest, following standard Southern blotting procedures, in order to identify and isolate the 
corresponding gene therein. Such probes can be considerably shorter than the entire se- 
quence, but should be at least 15, preferably at least 25, more preferably at least 35 nucleo- 
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tides in length, such as at ieast 70 nucleotides in length. It is, however, preferred that the 
polynucleotide probe is at least 100 nucleotides in length. For example, the polynucleotide 
probe may be at least 200 nucleotides in length, at least 300 nucleotides in length, at least 
400 nucleotides in length or at feast 500 nucleotides in length. Even longer probes may be 
5 used, e.g., polynucleotide probes which are at least 600 nucleotides in length, at least 700 
nucleotides in length, at least BOO nucleotides in length, or at least 900 nucleotides in length. 
Both DNA and RNA probes can be used. The probes are typically labeled for detecting the 
corresponding gene (for example, with ^^P, ^H, ^S, biotin, or avtdin). 

Thus, a genomic DNA or cDNA library prepared from such other organisms may be 

10 screened for DNA which hybridizes with the probes desolbed above and which encodes the 
xylanases of the invention. Genomic or other DNA from such other organisms may be sepa- 
rated by agarose or polyacrylamide gel electrophoresis, or other separation techniques. 
DNA from the libraries or the separated DNA may be transferred to. and immobilized, on ni- 
trocellulose or other suitable carrier materials. In order to identify a clone or DNA which has 

15 the required homology and/or identity or is homologous and/or Identical with of nucleotides 
52 to 699 of SEQ ID NO:1 . 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 the car- 
rier material with the immobilized DNA is used in a Southem blot. 

For purposes of the present invention, hybridization Indicates that the nucleotide se- 
quence hybridizes to a labeled polynucleotide probe which again hybridizes to the nucleotide 

20 sequence shown in nucleotides 52 to 699 of SEQ ID N0:1, 46 to 957 of SEQ ID NO:3 or 58 
to 660 of SEQ ID NO:5 under very low to very high stringency conditions. Molecules to 
which the polynucleotide probe hybridizes under these conditions may be detected using X- 
ray film or by any other method known in the art. Whenever the term "polynucleotide probe" 
is used in the present context, it is to be understood that such a probe contains at least 15 

25 nucleotides. 

In particular the probe comprise nucleotides encoding conserved portions of a Hisitidine (H) 
at position 1 , A or P at position 59, G at position 60, G at position 75. P or A at position 76. W 
or F at position 100, F or T at position 101. K or C at position 102, I or V or L at position 103, 
L or I or V or M at position 130. P at position 131, G and Xaa and Y at position 137-139, L or 

30 V or I or M at position 140, L or V or I or M at position 141, R at position 142, E or Q at 
positions 143-144, L or V or I at position 148, H or N at position 149. C at position 163 and P 
and G and P at position 209-21 1 in the mature xylanase. 

In an interesting embodiment the polynucleotide probe is the complementary strand 
of nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of SEQ ID NO:3 or 58 to 300 of SEQ ID 

35 NO:5. In another embodiment the polynucleotide probe is the complementary strand of 
nucleotides 301 to 699 of SEQ ID NO:1. 502 to 957 of SEQ ID NO:3 or 301 to 660 of SEQ ID 
NO:5. 

13 
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In another interesting embodiment, the polynucleotide probe is the complementary 
strand of the nucleotide sequence which encx)des the polypeptide of SEQ ID NO:2, SEQ ID 
N0:4 or SEQ ID NO:6 or the mature xytanases thereof. In a further interesting embodiment, 
the polynucleotide probe Is the complementary strand of SEQ ID N0:1, SEQ ID N0:3 or 
5 SEQ ID NO:5. In a still further interesting embodiment, the polynucleotide probe is the com- 
plementary strand of the mature polypeptide coding region of SEQ ID NO:1, SEQ ID NO:3 or 
SEQ ID NO:5. 

For long probes of at least 100 nucleotides in length, very low to very high stringency 
conditions are defined as prehybridization and hybridization at 42*^0 in 5X SSPE, 1 .0% SDS, 

10 5X Denhardt's solution, 100 \igim\ sheared and denatured salmon sperm DMA, following 
standard Southern blotting procedures. Preferably, the long probes of at least 100 nucleo- 
tides do not contain more than 1000 nucleotides. For long probes of at least 100 nucleotides 
in length, the carrier material is finally washed three times each for 15 minutes using 2 x 
SSC, 0.1% SDS at 42**C (very low stringency), preferably washed three times each for 15 

15 minutes using 0.5 x SSC. 0.1% SDS at 42**C (low stringency), more preferably washed three 
times each for 15 minutes using 0.2 x SSC. 0.1% SDS at 42''C (medium stringency), even 
more preferably washed three times each for 15 minutes using 0.2 x SSC, 0.1% SOS at 
SS^'C (medium-high stringency), most preferably washed three times each for 15 minutes us- 
ing 0.1 X SSC. 0.1% SDS at OO^'C (high stringency), in particular washed three times each for 

20 15 minutes using 0.1 x SSC, 0.1% SDS at 68'C (very high stringency). 

Although not particularly preferred, it is contemplated that shorter probes, e.g. probes 
which are from about 15 to 99 nucleotides in length, such as from about 15 to about 70 
nucleotides in length, may be also be used. For such short probes, stringency conditions are 
defined as prehybridization. hybridization, and washing post-hybridization at 5''C to lO^'C be- 

25 low the calculated Tm using the calculation according to Bolton and McCarthy (1962. Pro- 
ceedings of the National Academy of Sciences USA 48:1390) in 0.9 M NaCI. 0.09 M Tris-HCI 
pH 7.6, 6 mM EDTA, 0.5% NP-40, IX Denhardt's solution, 1 mM sodium pyrophosphate, 1 
mM sodium monobasic phosphate, 0.1 mM ATP, and 0.2 mg of yeast RNA per ml following 
standard Southern blotting procedures. 

30 For short probes which are about 15 nucleotides to 99 nucleotides in length, the car- 

rier material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 15 
minutes using 6X SSC at 5^C to IC'C below the calculated Tm. 

Sources of xylanases 

The xylanase of the present Invention may be obtained from microorganisms of any genus. 
35 For purposes of the present invention, the term "obtained from", as used herein shall mean 
that the xylanase encoded by the nudeotide sequence is produced by a cell in which the 
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nucleotide sequence is naturally present or Into which the nucleotide sequence has been 
inserted. 

Further xylanases obtainable from microorganisms are in a particular embodiment an 
extracellular xylanases I.e. an xylanase which is secreted or otherwise exported from a 
5 microorganism to its surrounding medium. 

Eukarvotic sources 

The xylanase of the invention may be obtained from eukaryotes particularly, plant 
cells or fungi. Particularly, the xylanase may be derived from fungi that degrade carbohy- 
drates, such as cellulosic substrates. Such fungi include e.g. Asoomycota. Basidiomycota. 

10 Zygomycota or Oomycota. In particular Verticiilium tenerum, Coprinus cinerius, Diplodia 
gossypinna, Humicola insolens, DichotomocJadium hesseltlnel, Pseudoplectania nigrella, 
Psilocybe inquilina and Trichophaea saccata. 

Other relevant fungi may be yeasts such as a Candida, Kiuyveromyces, Pichia, Sac- 
charomyces, Schizosaccharomyces, or Yarrowla or filamentous fungi such as an Acremo- 

15 nium, Aspergillus, Aureobasidium. Cryptococcus, Filibasidium, Fusarium, Humicola, Mag- 
naporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, 
Piromyces, Schizophyllum, Talaromyces, Themoascus, Thielavia, Totypocladium, or Tricho- 
derma. 

In an interesting embodiment, the xylanase is derived from Saccharomyces cadsber- 
20 gensis, Saccharomyces cerevisiae, Saccharomyces dlastaticus, Saccharomyces douglasii, 
Saccharomyces kluyveri, Saccharomyces norber^sis or Saccharomyces oviformts. 

In another interesting embodiment, the xylanase is derived from Aspergillus aculea- 
fus, Aspergillus awamon. Aspergillus foetidus. Aspergillus ;apon/cus. Aspergillus nidular}s, 
Aspergillus niger, Aspergillus oryzae, Fusarium bactridioldes, Fusarium cerealls, Fusarium 
25 crookwellertse, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium 
heterosporum, Fusarium negur\dt, Fusarium oxysporum, Fusarium reHculatum, Fusarium 
roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusa- 
rium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, 
Humicola insolens, Humicola lanuginosa, Mucormiehei, Myceliophthora thermophila, Neuro- 
' 30 spora crassa, Penicillium purpurogenum, Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiaium, Trichoderma reesei, or Trichoderma viride. 

it will be understood that for the aforementioned species, the invention encompasses 
both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs, re- 
gardless of the species name by which they are known. Those skilled in the art will readily 
35 recognize the identity of appropriate equivalents. 

Strains of these species are readily accessible to the public in a number of culture 
collections, such as the American Type Culture Collection (ATCC), Deutsche Sammlung von 
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Mikroorganismen und Zellkulturen GmbH (DSM), Centraalbureau Voor Schimmelcultures 
(CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Re- 
search Center (NRRL). 

Furthermore, such xylanases may be identified and obtained from other sources 
5 including microorganisms isolated from nature (e.g., soil, composts, water, etc.) using the 
above-mentioned probes. Techniques for isolating microorganisms from natural habitats are 
well known in the art. The nucleotide sequence may then be derived by similarly screening a 
genomic or cDNA library of another microorganism. Once a nucleotide sequence encoding a 
xylanase has been detected with the probe(s), the sequence may be isolated or cloned by 
10 utilizing techniques which are known to those of ordinary skill in the art (see, e.g.. Sambrook 
ef a/., 1989, supra). 

Xylanases encoded by nucleotide sequences of the present invention also Include 
fused xylanases or cleavable fusion xylanases In which another xylanase is fused at the N- 
terminus or the C-terminus of the xylanase or fragment thereof. A fused xylanase is pro- 
15 duced by fusing a nucleotide sequence (or a portion thereof) encoding another xylanase to a 
nucleotide sequence (or a portion thereof) of the present invention. Techniques for produc- 
ing fusion xylanases are known in the art, and Include ligating the coding sequences encod- 
ing the xylanases so that they are in frame and that expression of the fused xylanase is un- 
der control of the same promoter(s) and terminator. 

20 Nucleotide sequences 

The present invention also relates to polynucleotides comprising a nucleotide se- 
quence, which encodes a GH-61 xylanase. In particular, the present invention relates to 
polynucleotides consisting of a nucleotide sequence, which encodes a xylanase of the inven- 
tion. In a particular embodiment, the nucleotide sequence Is set forth In SEQ ID NO:1, SEQ 
25 ID NO:3 or SEQ ID NO:5. In a more particular emtxxliment, the nucleotide sequence is the 
mature xylanase coding region of SEQ ID NO:1 , SEQ ID NO:3 or SEQ ID NO:5. 

The present Invention also encompasses polynucleotides comprising, preferably 
comprising or consisting of, nucleotide sequences which encode a xylanase consisting of the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. or the mature xy- 
30 lanase thereof including nucleotide sequences differing from SEQ ID NO:1, SEQ ID NO:3 or 
SEQ ID NO:5 by virtue of the degeneracy of the genetic code. 

The present invention also relates to polynucleotides comprising, preferably consist- 
ing of, a subsequence of: 

SEQ ID NO:1 which encode fragments of SEQ ID NO:2 that exhibit xylanase activity 
35 and contains the conserved portions. 

SEQ ID NO:3 which encode fragments of SEQ ID NO:4 Wiat exhibit xylanase activity 

and contains the conserved portions. 
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SEQ ID NO:5 which encode fragments of SEQ ID NO:6 that exhibit xylanase activity 
and contains the conserved portions. 

A subsequence of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5 is a nucleotide sequence 
5 encompassed by SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5 except that one or more nu- 
cleotides from the 5* and/or 3* end have been deleted. 

The present invention also relates to polynucleotides comprising, preferably consisting of, a 
modified nucleotide sequence which comprises at least one modification/mutation in the 
mature xylanase coding sequence of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5. and 

10 where the modified nucleotide sequence encodes a xylanase comprising a H at position 1, A 
or P at position 59, G at position 60, G at position 75. P or A at position 76, W or F at position 
100, F or T at position 101, K or C at position 102, 1 or V or L at position 103. L or I or V or M 
at position 130, P at position 131, G and Xaa and Y at position 137-139, L or V or I or M at 
position 140, L or V or I or M at position 141. R at position 142, E or Q at positions 143-144. 

15 L or V or I at position 148, H or N at position 149, C at position 163 and P and G and P at 
position 209-211. 

The techniques used to isolate and/or clone a nucleotide sequence encoding an xy- 
lanase are known in the art and include isolation from genomic DNA, preparation from cDNA, 

20 or a combination thereof. The cloning of the nucleotide sequences of the present invention 
from such genomic DNA can be effected, e.g., by using the well known polymerase chain 
reaction (PCR) or antibody screening of expression libraries to detect cloned DNA fragments 
with shared structural features. See, e.g., Innis et aL. 1990, PCR: A Guide to Methods and 
Application, Academic Press, New York. Other amplification procedures such as ligase 

25 chain reaction (LCR), ligated activated tiBnscription (LAT) and nucleotide sequence-based 
amplification (NASBA) may be used. The nucleotide sequence may be cloned from a strain 
of Humicola or Coprinus, or another or related organism and thus, for example, may be an 
allelic or spedes variant of the xylanase encoding region of the nucleotide sequence. 

The nucleotide sequence may be obtained by standard cloning procedures used in 

30 genetic engineering to relocate the nucleotide sequence from its natural location to a differ- 
ent site where it will be reproduced. The cloning procedures may involve excision and isola- 
tion of a desired fragment comprising the nucleotide sequence encoding the xylanase, inser- 
tion of the fragment into a vector molecule, and incorporation of the recombinant vector into a 
host cell where multiple copies or clones of the nucleotide sequence will be replicated. The 

35 nucleotide sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any 
combinations thereof. 

The present invention also relates to a polynucleotide having, preferably consisting 
of, a nucleotide sequence which has at least 65% identity with nucleotides 52 to 699 of SEQ 
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10 NO:1, 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. Particularly, the nudeotide 
sequence has at least 70% identity, e.g. at least 80% identity, such as at least 90% Identity, 
more preferably at least 95% identity, such as at least 96% Identity, e.g. at least 97% identity, 
even more preferably at least 98% identity, such as at least 99% with nucleotides 52 to 699 
5 of SEQ ID NO:1 , 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. Preferably, the nu- 
deotide sequence encodes a xylanase. The degree of identity between two nucleotide se- 
quences is determined as described previously (see the section entitled "Definitions"). Par- 
ticularly, the nudeotide sequence comprises nudeotides 52 to 699 of SEQ ID NO:1, 46 to 
957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. In an even more particular embodiment, 

10 the nudeotide sequence consists of nudeotides 52 to 699 of SEQ ID NO:1. 46 to 957 of 
SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. 

Modification of a nucleotide sequence encoding an xylanase of the present invention 
may be necessary for the synthesis of an xylanase, which comprises an amino acid se- 
quence that has at least one substitution, deletion and/or insertion as compared to amino ac- 

15 ids 18-233 of SEQ ID NO:2, 16-319 of SEQ ID NO:4 or 20-220 of SEQ ID NO:6. These artifi- 
dal variants may differ in properties or other way from the xylanase isolated from its native 
source, e.g., variants may differ in cart>ohydFate adivation capabilities, carbohydrate specific- 
ity, thermostability, pH optimum or the like. 

It will be apparent to those skilled In the art that such modifications can be made to 

20 preserve the membership of the xylanase to the GH-61 homology group and can be made 
outside regions critical to the function of the molecule regions and still result in a xylanase. 
Amino acid residues essential to the function or the GH-61 characteristics of the xylanase 
encoded by the nucleotide sequence of the invention are therefore preferably not subject to 
modification, such as substitution. Amino acid residues essential to the function may be Iden- 

25 tified according to procedures known In the art. such as site-direded mutagenesis or alanine- 
scanning mutagenesis (see, e.g., Cunningham and Wells, 1989, Science 244: 1081-1085). 
In the latter technique, mutations are introduced at every positively charged residue in the 
molecule, and the resultant mutant molecules are tested for the carbohydrate activation func- 
tion to identify amino acid residues that are critical to the fundion of the molecule. Sites of 

30 carbohydrate-xylanase interadion can also be determined by analysis of the three- 
dimensional strudure as determined by such techniques as nuclear magnetic resonance 
analysis, crystallography or photoaffinlty lat>eiling (see, e.g., de Vos et a/., 1992. Science 
255: 306-312; Smith et ai„ 1992. Journal of Molecular Bioiogy 224: 899-904; Wlodaver et a/.. 
1992. FEBS Letters 309: 59-64). 

35 Moreover, a nucleotide sequence encoding a xylanase of the present invention may 

be modified by introduction of nudeotide substitutions which do not give rise to another 
amino add sequence of the xylanase encoded by the nucleotide sequence, but which corre- 
spond to the codon usage of the host organism intended for produdion of the xylanase. 
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The introduction of a mutation into the nucleotide sequence to exchange one nucleo- 
tide for another nucleotide may be accomplished by site-directed mutagenesis using any of 
the methods known in the art. Particularly useful is the procedure, which utilizes a super- 
coiled, double stranded DNA vector with an insert of interest and two synthetic primers con- 
5 talning the desired mutation. The oligonucleotide primers, each complementary to opposite 
strands of the vector, extend during temperature cycling by means of Pfu DNA polymerase. 
On incorporation of the primers, a mutated plasmid containing staggered nicks is generated. 
Following temperature cycling, the product is treated with Dpn\ which is specific for methy- 
lated and hemimethylated DNA to digest the parental DNA template and to select for muta- 
10 tion-containing synthesized DNA. Other procedures known in the art may also be used. For 
a general description of nucleotide substitution, see, e.g.. Ford et a/., 1991, Protein Expres- 
sion and Purification 2: 95-107. 

The present invention also relates to a polynucleotide having, preferably consisting of, a nu- 
cleotide sequence which encodes a GH-61 xylanase and which hybridizes under very low 
15 stringency conditions, preferably under low stringency conditions, more preferably under me- 
dium stringency conditions, more preferably under medium-high stringency conditions, even 
more preferably under high stringency conditions, and most preferably under very high strin- 
gency conditions with a polynucleotide probe selected from the group consisting of: 

20 (i) the complementary strand of nucleotides 52 to 699 of SEQ ID N0:1, 46 to 957 of 
SEQ ID NO:3 or 58 to 660 of SEQ ID N0:5. 

(ii) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 
of SEQ ID NO:1 . 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3, 

25 (iv) the complementary strand of nucleotides 52 to 300 of SEQ ID NO:1, 46 to 501 of 
SEQ ID NO:3 or 58 to 300 of SEQ ID NO:5, 
(v) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1, 502 to 957 of 
SEQ ID NO:3 or 301 to 660 of SEQ ID NO:5, 

30 (J. Sambrook. E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning. A Laboratory Manual. 
2d edition, Cold Spring Harbor, New Yortc). 



As will be understood, details and particulars concerning hybridization of the nucleotide se- 
quences will be the same or analogous to the hybridization aspects discussed in the section 
titled "GH-61 xylanases" herein. 
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The present invention also encompass a storage medium, such as a computer disk, 
containing in digital form or in any other form the sequences as mentioned herein. 

Nucleic acid construct comprising nucleotide sequences 

The present Invention also relates to nucleic acid constructs comprising a nucleotide se- 
5 quence of the invention operably linked to one or more control sequences that direct the ex- 
pression of the coding sequence in a suitable host cell under conditions compatible with the 
control sequences. 

A nucleotide sequence encoding a xylanase of the invention may be manipulated in a 
variety of ways to provide for expression of the xylanase. Manipulation of the nucleotide se- 

10 quence prior to its insertion into a vector may be desirable or necessary depending on the 
expression vector. The techniques for modifying nucleotide sequences utilizing recombinant 
DNA methods are well known in the art. 

The control sequence may be an appropriate promoter sequence, a nucleotide se- 
quence which Is recognized by a host cell for expression of the nucleotide sequence. The 

15 promoter sequence contains transcriptional control sequences, which mediate the expres- 
sion of the polypeptide. The promoter may be any nucleotide sequence which shows tran- 
scriptional activity in the host cell of choice including mutant, truncated, and hybrid promot- 
ers, and may be obtained from genes encoding extracellular or intracellular polypeptides ei- 
ther homologous or heterologous to the host cell. 

20 Examples of suitable promoters for directing the transcription of the nucleic add con- 

structs of the present invention, especially in a bacterial host cell, are the promoters obtained 
from the E. coU lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis 
levansucrase gene (sacB), Bacillus llchenifomis alpha-amylase gene {amyL), Badllus 
stearothermophilus maltogenic amylase gene (amyM), Bacillus amyioliquefaciens alpha- 

25 amylase gene (amyO), Bacillus licheniformis penicillinase gene (penP), Bacillus subtilis xylA 
and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proceed- 
ings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac promoter 
(DeBoer et ai, 1983. Proceedings of the National Academy of Sciences USA 80: 21-25). 
Further promoters are described in "Useful proteins from recombinant bacteria" in Scientific 

30 American, 1980, 242: 74-94; and In Sambrook et al., 1989, supra. 

Examples of suitable promoters for directing the transcription of the nucleic add con- 
structs of the present invention in a filamentous fungal host cell are proniK>ters obtained from 
the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, 
Aspergillus niger neutral alpha-amylase, Aspergillus n/ger acid stable alpha-amylase, Asper- 

35 gillus niger or Aspergillus awamori glucoamylase {glaA), Rhizomucor miehei lipase, Aspergil- 
lus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase. Aspergillus 
nidulans acetamidase, and Fusarium oxysporum trypsln-like protease (WO 96/00787), as 

20 
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well as the NA2-tpl pronnoter (a hybrid of the promoters from the genes for Aspergillus niger 
neutral alpha-amylase and AspergUlus oryzae those phosphate isomerase), and mutant, 
truncated, and hybrid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Saccharomyces 
5 cerevislae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharo- 
myces cerevislae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase 
{ADH2/GAP), and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful pro- 
moters for yeast host cells are described by Romanos aL, 1992, Yeast 8: 423-488. 

The control sequence may also be a suitable transcription terminator sequence, a 
10 sequence recognized by a host cell to terminate transcription. The terminator sequence is 
operably linked to the 3' temiinus of the nucleotide sequence encoding the xylanase. Any 
terminator which Is functional in the host cell of choice may be used in the present invention. 

Preferred temninators for filamentous fungal host cells are obtained from the genes 
for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidularys 
1 5 anthranilate synthase, Aspergillus niger alpha-glucosidase. and Fusarium oxysporum trypsln- 
like protease. 

Prefenred terminators for yeast host cells are obtained from the genes for Saccharo- 
myces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Sac- 
charomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful termina- 
20 tors for yeast host cells are described by Romanos et a/., 1992, supra. 

The control sequence may also be a suitable leader sequence, a non-translated re- 
gion of an mRNA which is important for translation by the host ceil. The leader sequence is 
operably linked to the 5' tenminus of the nucleotide sequence encoding the polypeptide. Any 
leader sequence that is functional in the host cell of choice may be used in the present inven- 
25 tion. Preferred leaders for filamentous fungal host cells are obtained from the genes for 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans those phosphate Isomerase. 

Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces 
cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase. Sac- 
charomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydro- 
30 genase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP). 

The control sequence may also be a polyadenylation sequence, a sequence operably 
linked to the 3' terminus of the nucleotide sequence and which, when transcribed, is recog- 
nized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any 
polyadenylation sequence which is functional In the host cell of choice may be used in the 
35 present invention. 

Preferred polyadenylation sequences for filamentous fungal host cells are obtained 
from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase. As- 
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pergillus nidulans anthranilate synthase, Fusarium oxysporum trypsin-like protease, and As- 
pergiilus niger alpha-glucosidase. 

Useful polyadenylation sequences for yeast host oelJs are described by Guo and 
Sherman, 1 995, Molecular Cellular Biology 1 5: 5983-5990. 
5 The control sequence may also be a signal peptide coding region that codes for an 

amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded 
xylanase into the cell's secretory pathway. The 5* end of the coding sequence of the nucleo- 
tide sequence may Inherently contain a signal peptide coding region naturally linked in trans- 
lation reading frame with the segment of the coding region which encodes the secreted xy- 
10 ianase. Alternatively, the 5' end of the coding sequence may contain a signal peptide coding 
region which is foreign to the coding sequence. The foreign signal peptide coding region 
may be required where the coding sequence does not naturally contain a signal peptide cod- 
ing region. Alternatively, the foreign signal peptide coding region may simply replace the 
natural signal peptide coding region in order to enhance secretion of the xylanase. However, 
15 any signal peptide coding region which directs the expressed xylanase into the secretory 
pathway of a host cell of choice may be used in the present invention. 
The signal peptide coding region is nucleotides 1 to 51 of SEQ ID NO:1, 1 to 51 of SEQ ID 
NO:3 or 1 to 57 of SEQ ID NO:5 which encode amino ackis 18 to 233 of SEQ ID NO:2. 16 to 
319 of SEQ ID NO:4 or 20 to 220 of SEQ ID NO:6. 
20 Effective signal peptide coding regions for bacterial host cells are the signal peptide 

coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacil- 
lus stearothermophilus alpha-amylase, Bacillus licheniformis subtilisin. Bacillus licheniformis 
beta-lactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacil- 
lus subtilis prsA. Further signal peptides are described by Simonen and Palva, 1993, Micro- 
is biological Reviews 57: 1 09-1 37. 

Effective signal peptide coding regions for filamentous fungal host cells are the signal 
peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, As- 
pergillus niger neutral amylase, Aspergillus n/ger glucoamylase, Rhizomucor miehei aspartlc 
proteinase, Humicola insolens cellulase, and Humicola lanuginosa lipase. 
30 Useful signal peptides for yeast host cells are obtained from the genes for Saccharo- 

myces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal 
peptide coding regions are described by Romanes etai, 1992, supra. 

The control sequence may also be a propeptide coding region that codes for an 
amino acid sequence positioned at the amino terminus of a xylanase. The resultant polypep- 
35 tide may be denoted a pro-xylanase or propolypeptide. A propolypeptide is generally inac- 
tive and can be converted to a mature active polypeptide by catalytic or autocatalytic cleav- 
age of the propeptide from the propolypeptide. The propeptide coding region may be ob- 
tained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral 
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protease {nprT), Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic pro- 
teinase, and Myceiiophthom thermophila laccase (WO 95/33836). 

Where both signal peptide and propeptide regions are present at the amino terminus 
of a polypeptide, the propeptide region is positioned next to the amino terminus of a polypep- 
5 tide and the signal peptide region is positioned next to the amino terminus of the propeptide 
region. 

In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the 
TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus 
oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of 
10 regulatory sequences are those which allow for gene amplification. In eukaryotic systems, 
these include the dihydrofolate reductase gene which is amplified in the presence of meth- 
otrexate, and the metallothionein genes which are amplified with heavy metals. In these 
cases, the nucleotide sequence encoding the polypeptide would be operably linked with the 
regulatory sequence. 

1 5 Recombinant expression vector comprising nucleic acid construct 

The present invention also relates to recombinant expression vectors comprising the nucleic 
acid construct of the invention. The various nucleotkie and control sequences described 
above may be joined together to produce a recombinant expression vector, which may in- 
clude one or more convenient restriction sites to allow for insertion or substitution of the nu- 

20 cleotide sequence encoding the polypeptide at such sites. Altematively, the nucleotide se- 
quence of the present invention may be expressed by inserting the nucleotide sequence or a 
nucleic acid construct comprising the sequence into an appropriate vector for expression. In 
creating the expression vector, the coding sequence is located in the vector so that the cod- 
ing sequence Is operably linked with the appropriate control sequences for expression. 

25 The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 

can be conveniently subjected to recombinant DNA procedures and can bring about the ex- 
pression of the nucleotide sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell into which the vector is to be introduced. The 
vectors may be linear or dosed circular piasmids. 

30 The vector may be an autonomously replicating vector, /.e., a vector which exists as 

an extrachromosomal entity, the replication of which is independent of chromosomal replica- 
tion, e.g., a plasmid. an extrachromosomal element, a minichromosome, or an artificial 
chromosome. 

The vector may contain any means for assuring self-replication. Altematively, the 
35 vector may be one which, when Introduced into the host cell, is integrated into the genome 
and replicated together with the chromosome(s) into which it has been integrated. Further- 
more, a single vector or plasmid or two or more vectors or plasmids which together contain 
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the total DNA to be introduced into the genome of the host cell, or a transposon may be 
used. 

The vectors of the present Invention preferably contain one or more selectable mark- 
ers which permit easy selection of transfomned cells. A selectable marker is a gene the 
5 product of which provides for biocide or viral resistance, resistance to heavy metals, proto- 
trophy to auxotrophs, and the like. 

ExampJes of bacterial selectable markers are the dat genes from Bacillus subtifis or 
Bacillus licheniformis, or markers which confer antibiotic resistance such as ampicillin. 
kanamycin, chloramphenicol or tetracycline resistance. Suitable markers for yeast host cells 

10 are ADE2. HISS, LEU2. LYS2, MET3, TRP1. and URA3. Selectable markers for use In a fil- 
amentous fungal host cell include, but are not limited to, amdS (acetamidase), argB (or- 
nithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hygB (hygromydn 
phosphotransferase). niaD (nitrate reductase), pyrG (orotidine-5'-phosphate decarboxylase), 
sC (sulfate adenyltransferase), trpC (anthranfiate synthase), as well as equivalents thereof. 

15 Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus 

nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus. 

The vectors of the present Invention preferably contain an element(s) that penmits 
stable integration of the vector into the host cell's genome or autonomous replication of the 
vector in the cell independent of the genome. 

20 For integration into the host cell genome, the vector may rely on the nucleotide se- 

quence encoding the polypeptide or any other element of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous recombination. Alternatively, 
the vector may contain additional nucleotide sequences for directing integration by homolo- 
gous recombination into the genome of the host cell. The additional nucleotide sequences 

25 enable the vector to be integrated into the host cell genome at a precise k3cation(s) in the 
chromosome(s). To increase the likelihood of integration at a precise location, the integra- 
tlonal elements should preferably contain a sufficient number of nucleotides, such as 100 to 
1,500 base pairs, preferably 400 to 1.500 base pairs, and most preferably 800 to 1,500 base 
pairs, which are highly homologous with the corresponding target sequence to enhance the 

30 probability of homologous recombination. The integratlonal elements may be any sequence 
that is homologous with the target sequence In the genome of the host cell. Furthermore, the 
integratlonal elements may be non-encoding or encoding nucleotide sequences. On the 
other hand, the vector may be integrated into the genome of the host cell by non- 
homologous recombination. 

35 For autonomous replication, the vector may further comprise an origin of replication 

enabling the vector to replicate autonomously In the host cell in question. Examples of bac- 
terial origins of replication are the origins of replication of plasmids pBR322, pUC19, pA- 
CYC177, and pACYC184 permitting replication in E. coll, and pUBIIO. pE194, pTA1060. 
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and pAMS1 permitting replication in Bacillus. Examples of origins of replication for use in a 
yeast host cell are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 
and CEN3, and the combination of ARS4 and CEN6. The origin of replication may be one 
having a mutation which makes its functioning temperature-sensitive in the host cell (see, 
e.g., Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

More than one copy of a nucleotide sequence of the present invention may be in- 
serted into the host celt to increase production of the gene product. An increase in the copy 
number of the nucleotide sequence can be obtained by Integrating at least one additional 
copy of the sequence into the host cell genome or by including an amplifiable selectable 
marker gene with the nucleotide sequence where cells containing amplified copies of the se* 
lectable marker gene, and thereby additional copies of the nucleotide sequence, can be se- 
lected for by cultivating the cells in the presence of the appropriate selectable agent. 

The procedures used to ligate the elements described above to construct the recom- 
binant expression vectors of the present Invention are well known to one skilled in the art 
(see, e.g., Sambrook ef a/., 1989, supra). 

Recombinant host cell comprising nucleic acid construct 

The present Invention also relates to recombinant a host cell comprising the nucleic add con- 
struct of the invention, which are advantageously used in the recombinant production of the 
polypeptides. A vector comprising a nucleotide sequence of the present invention is in- 
20 troduced into a host cell so that the vector is maintained as a chromosomal integrant or as a 
self-replica ting extra-chromosomal vector as described eariier. 

The host cell may be a unicellular microorganism, e.g., a prokaryote or a non- 
unicellular microorganism, e.g.. a eukaryote. 

Useful unicellular cells are t>acterial cells such as gram positive bacteria including, but 
25 not limited to, a Bacillus cell, e.g., Bacillus alkalophilus. Bacillus amyloliquefaciens. Bacillus 
brevis. Bacillus circulans. Bacillus clausii. Bacillus coagulans. Bacillus lautus. Bacillus lentus. 
Bacillus lichenifonnis. Bacillus megaterium. Bacillus stearothennophilus. Bacillus subtilis, and 
Bacillus thuringiensis', or a Streptomyces cell, e.g., Streptomyces lividans or Streptomyces 
murinus, or gram negative bacteria such as E. coli and Pseudomonas sp. In a preferred em- 
30 bodiment, the bacterial host cell is a Bacillus lentus. Bacillus lichenifonnis. Bacillus 
stearotherwophilus. or Bacillus subtilis cell. In another preferred embodiment, the Bacillus 
cell is an alkalophilic Bacillus. (KKSC/SALK verify relevans for GH-61 xylanases) 

The introduction of a vector into a bacterial host cell may, for Instance, be effected by 
protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 
35 168: 111-115). using competent cells (see. e.g., Young and Splzizin. 1961, Journal of Bacte- 
riology 81 : 823-829, or Dubnau and Davidoff-Abelson, 1971. Journal of Molecular Biology 56: 
209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742- 

25 
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751), or conjugation (see, e.g., Koehler and Thome, 1987, Journal of Bacteriology 169: 
5771-5278). 

The host cell may be a eukaryote, such as a mammalian, insect, plant, or fungal cell. 
In a preferred embodiment, the host cell is a fungai cell. "Fungi" as used herein in- 
5 dudes the phyla Ascomycota, Basidiomycota, Chytridiomyoota, and Zygomycota (as defined 
by Hawlcsworth et aL, In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition. 1995, 
CAB Intemational. University Press. Cambridge, UK) as well as the Oomycota (as cited In 
Hawksworth et aL, 1995. supra, page 171) and all mitosporic fungi (Hawksworth et ai, 1995, 
supra). In a more prefenred embodiment, the fungal host cell is a yeast cell. "Yeast" as used 

10 herein Includes asoosporogenous yeast (Endomycetales), basidiosporogenous yeast, and 
yeast belonging to the Fungi Imperfect! (Blastomycetes). Since the classification of yeast 
may change in the future, for the purposes of this invention, yeast shall be defined as de- 
scribed in Biology and Activities of Yeast (Skinner, F.A., Passmore, S.M., and Davenport, 
R.R., eds. Soc. App. Bacteriol. Symposium Series No. 9. 1980). 

15 In an even more preferred embodiment, the yeast host cefl Is a Candida, Hansenula, 

Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell. 

In a most preferred embodiment, the yeast host cell is a Saccharomyces carisber- 
gensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, 
Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis cell. In 

20 another most preferred embodiment, the yeast host cell is a Kluyveromyces lactis cell, in 
another most preferred embodiment, the yeast host cell Is a Yarrowia lipolytica cell. 

In another more preferred embodiment, the fungal host cell is a filamentous fungal 
cell. "Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oo- 
mycota (as defined by Hawksworth et aL, 1995, supra). The filamentous fungi are character- 

25 ized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan. and other 
complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism 
is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cere- 
visiae is by budding of a unicellular thallus and carbon catabolism may be fermentative. 

In an even more preferred embodiment, the filamentous fungal host cell Is a cell of a 

30 species of, but not limited to. Acremonium, Aspergillus, Fusarium, Humicola, Mucor, My- 
celiophthora, Neurospora, Penicillium, Thielavia, Tolypocladium, or Trichoderma, 

In a most preferred embodiment, the filamentous fungal host cell is an Aspergillus 
awamori, Aspergillus fbetldus, Aspergillus Japonicus, Aspergillus nklulans, Aspergillus niger 
or Aspergillus oryzae cell. In another most preferred embodiment, the filamentous fungal 

35 host cell is a Fusarium bactridloldes, Fusarium cerealis, Fusarium crookwellense, Fusarium 
culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium 
negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sam- 
bucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum. Fuse- 
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rium torulosum, Fusarium trichothecioides, or Fusarium venenatum cell. In an even most 
preferred embodiment, the filamentous fungal parent ceil is a Fusarium venenatum (Niren- 
berg sp. nov.) cell. In another most prefen-ed embodiment, the filamentous fungal host cell is 
a Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceiiophthora thenvophila, Neu- 
5 rospora crassa, Penicillium purpurogenum, Thielavia tenestris, Trichodemia harzianum, 
Trichodem^a koningii, Trichodemna longibrachiatum, Trichoderma reesei, or Trichodenna 
viride cell. 

Fungal cells may be transfonned by a process Involving protoplast formation, trans- 
fomiation of the protoplasts, and regeneration of the ceil wall in a manner known per se. 

10 Suitable procedures for transformation of Aspergilius host cells are described in EP 238 023 
and Yelton et aL, 1984. Proceedings of the Nationai Academy of Sciences USA 81: 1470- 
1474. Suitable methods for transforming Fusarium species are described by Malardier et al., 
1989, Gene 78: 147-156 and WO 96/00787. Yeast may be transformed using the proce- 
dures described by Becker and Guarente, In Abelson, J.N. and Simon, M.I., editors. Guide to 

15 Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, 
Academic Press, Inc., New York; Ito etal., 1983, Journal of Bacteriology 153: 163; and Hin- 
nen etal., 1978, Proceedings of the National Academy of Sciences USA 75: 1920. 

Processes for preparing functional GH-61 xylanases 

The present invention also relates to methods for producing a xylanase of the present Inven- 

20 tion comprising (a) cultivating a strain, which in its wild-type form is capable of producing the 
xylanase; and (b) recovering the xylanase. Preferably, the strain is a fungus, more prefera- 
bly of the genus Humicola, particularly Humicola insolens or Coprinus, such as Coprinus 
cinereus or Thelavia such as Thelavia terrestris 

The present invention also relates to methods for producing a xylanase of the inven- 

25 tion comprising (a) cultivating a host cell under conditions conducive for production of the xy- 
lanase; and (b) recovering the xylanase. 

In the production methods of the present invention, the cells are cultivated in a nutri- 
ent medium suitable for production of the xylanase using methods known in the art. For ex- 
ample, the cell may be cultivated by shake flask cultivation, small-scale or large-scale fer- 

30 mentation (including continuous, batch, fed-batch, or solid state femientatlons) in laboratory 
or industrial femrientors perfonmed in a suitable medium and under conditions allowing the 
polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient 
medium comprising carbon and nitrogen sources and inorganic salts, using procedures 
known in the art. Suitable media are available from commerdal suppliers or may be pre- 

35 pared according to published compositions (e.g., in catalogues of the American Type Culture 
Collection). If the xylanase is secreted into the nutrient medium, the xylanase can be recov- 
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ered directly from the medium. If the xylanase is not secreted, it can be recovered from cell 
lysates. 

The xylanase may be detected using methods Icnown in the art and modifications 
thereof that are specific for the xylanase. These detection methods may include use of spe- 
5 cific antibodies, formation of a xylanase - carbohydrate complex, or disappearance of an ac- 
tivated carbohydrate sut)strate. For example, a carbohydrase assay may be used to deter- 
mine the activity of a carbohydrase on a carbohydrate with and without presence of xy- 
lanase. 

The resulting xylanase may be recovered by methods known in the art. For example, 
10 the xylanase may be recovered from the nutrient medium by conventional procedures includ- 
ing, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or 
precipitation. 

The polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
15 hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., prepa- 
rative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS- 
PAGE, or extraction (see, e.g.. Protein Purificatton, J.-C. Janson and Lars Ryden, editors, 
VCH Publishers. New York, 1989). 

Transgenic plants 

20 The present invention also relates to a transgenic plant, plant part, or plant cell 

which has been transfonmed with a nucleotide sequence encoding a xylanase of the present 
invention so as to express and produce the xylanase. In one embodiment the plant could be 
used as host for production of xylanase in recoverable quantities. The xylanase may be 
recovered from the plant or plant part. Alternatively, the plant or plant part containing the 

25 recombinant xylanase may be used as such for improving the quality of a food or feed, e.g., 
improving nutritional value, palatability, and rheological properties, or to destroy an 
antinutritive factor. In particular the plant or plant parts expressing the xylanase may be used 
as an improved starting material for production of fuel-alcohols or bio-ethanol 

The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a 

30 monocot). Examples of monocot plants are grasses, such as meadow grass (blue grass, 
Poa), forage grass such as festuca, loKum, temperate grass, such as Agrostis, and cereals, 
e.g., wheat, oats, rye, barley, rice, sorghum, and maize (com). 

Examples of dicot plants are tobacco, legumes, such as lupins, potato, sugar beet, 
pea, bean and soybean, and cruciferous plants (family Brassicaceae), such as cauliflower. 

35 rape seed, and the closely related model organism Arabidopsis thaliana. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. 
Also speciflc plant tissues, such as chloroplast, apoplast, mitochondria, vacuole. 
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peroxisomes, and cytoplasm are considered to be a plant part. Furthermore, any plant cell, 
whatever the tissue origin. Is considered to be a plant part. 

Also included within the scope of the present invention are the progeny of such 
plants, plant parts and plant cells. 

The transgenic plant or plant cell expressing a xylanase of the present invention 
may be constructed In accordance with methods known in the art. Briefly, the plant or plant 
cell is constructed by incorporating one or more expression constructs encoding a xylanase 
of the present invention into the plant host genome and propagating the resulting modified 
plant or plant cell into a transgenic plant or plant cell. 

Convenientiy, the expression construct is a nucleic add construct which comprises a 
nucleotide sequence encoding a xylanase of the present invention operably linked with 
appropriate regulatory sequences required for expression of the nucleotide sequence in the 
plant or plant part of choice. Furthermore, the expression construct may comprise a 
selectable mariner useful for identifying host cells into which the expression construct has 
been integrated and DNA sequences necessary for introduction of the construct into the 
plant in question (the latter depends on tiie DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and tenminator sequences 
and optionally signal or transit sequences, is detemnined, for example, on tiie basis of when, 
where, and how the xylanase is desired to be expressed. For instance, the expression of the 
gene encoding a xylanase of the present invention may be constitutive or inducible, or may 
be developmental, stage or tissue specific, and the gene product may be targeted to a 
specific tissue or plant part such as seeds or leaves. Regulatory sequences are, for 
example, described by Tague etaL, 1988, Plant Physiology Q6: 506. 

For constitutive expression, the 35S-CaMV promoter may be used (Franck et aL, 
1980, Cell 21: 285-294). Organ-specific promoters may be, for example, a promoter from 
storage sink tissues such as seeds, potato tubers, and fruits (Edwards & Coruzzi, 1990, Ann. 
Rev. Genet 24: 275-303), or from metabolic sink tissues such as meristems (Ito et al, 1994, 
Plant MoL Biol. 24: 863-878), a seed specific promoter such as the glutelin. prolamin, globu- 
lin, or albumin promoter from rice (Wu a/.. 1998, Plant and Cell Physiology 39: 885-889). a 
Vicia faba promoter from the legumin B4 and the unknown seed protein gene from Vicia faba 
(Conrad e^ a/.. 1998, Journal of Plant Physiology 152: 708-711), a promoter from a seed oil 
body protein (Chen et al., 1998, Plant and Cell Physiology 39: 935-941), tiie storage protein 
napA promoter from Brassica napus, or any other seed specific promoter known in the art, 
e.g., as described In WO 91/14772. Furthermore, the promoter may be a leaf specific pro- 
moter such as the rbcs promoter from rice or tomato (Kyozuka ef aL, 1993, Plant Physiology 
102: 991-1000, the chlorella virus adenine methyltransferase gene promoter (Mitra and Hig- 
gins. 1994, Plant Molecular Biology 26: 85-93), or the aldP gene promoter from rice (Kagaya 



29 



a/.. 1995, Molecular and General Genetics 248: 668-674). or a wound inducible promoter 
such as the potato pin2 promoter (Xu et aL, 1993, Plant Molecular Biology 22: 573-588). 

A promoter enhancer element may also be used to achieve higher expression of the 
xylanase in the plant. For instance, the promoter enhancer element may be an intron whidi 
is placed between the promoter and the nucleotide sequence encoding an xylanase of the 
present invention. For instance, Xu et aL, 1993, supra disclose the use of the first intron of 
the rice actin 1 gene to enhance expression. 

The selectable marker gene and any other parts of the expression construct may be 
chosen from those available in the art 

The nucleic acid construct is incorporated into the plant genome according to 
conventional techniques known in the art, including Agrobacterium-med'iaied transformation, 
virus-mediated transformation, microinjection, particle bombardment, biolistic transformation, 
and electroporation (Gasser et ai, 1990, Scfence 244: 1293; Potrykus, 1990, Bio/Technology 
8: 535; Shimamoto etaL, 1989. Nature 338: 274). 

Presently, Agrobacterium tumefaciens-medlated gene transfer is the method of 
choice for generating transgenic dicots (for a review, see Hooykas and Schilperoort, 1992. 
Plant Molecular Biology 19: 15-38). However it can also be used for transfonming monocots, 
although other transformation methods are generally preferred for these plants. Presently, 
the method of choice for generating transgenic monocots is particle bombardment 
(microscopic gold or tungsten particles coated with the transforming DNA) of embryonic calli 
or developing embryos (Christou. 1992. Plant Journal 2: 275-281; Shimamoto, 1994, Current 
Opinion Biotechnology 5: 158-162; Vasil et aL, 1992, Bio/Technology 10: 667-674). An 
altemative method for transformation of monocots is based on protoplast transformation as 
described by Omirulleh etaL, 1993, Plant Molecular Biology 2"^: 415-428. 

Following transfonmation, the transformants having incorporated therein the 
expression construct are selected and regenerated into whole plants according to methods 
vt«ll*known in the art. 

The present invention also relates to methods for producing an xylanase of the 
present invention comprising (a) cultivating a transgenic plant or a plant cell comprising a 
nucleotide sequence encoding an xylanase of the present invention under conditions 
conducive for production of the xylanase; and (b) recovering the xylanase. 

Compositions comprising functional GH-61 xylanases 

In a still further aspect, the present invention relates to compositions comprising a 
xylanase of the present invention and their preparation, in particular compositions wherein 
the xylanase of the invention is the major component of the composition, e.g.. a mono- 
component composition. 
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The composition may further comprise one or more enzymes. In particular carbohy- 
drases such as amylase, glucanase, xylanase, gaJactanase, mannanase etc. The enzymes 
may also include enzymes such as aminopeptJdase, carboxypeptidase, catalase, chitinase, 
cutinase, cyclodextrin glycosyltransferase. deoxyribonuclease, esterase, alpha- 
5 galactosldase, t>eta-galactosidase. glucoamylase, alpha-glucosidase, beta-glucosidase. 
haloperoxidase, invertase, laccase, lipase, mannosidase, oxidase, pectinolytlc enzymes, 
peptidoglutaminase, peroxidase, phytase, polyphenoloxidase. proteolytic enzyme, ribonucle- 
ase, transglutaminase, or xylanase. 

The compositions may be prepared in accordance with methods known in the art and 
10 may be in the form of a liquid or a solid composition. For Instance, the xylanase composition 
may be formulated using methods known to the art of formulating enzymes and/or 
pharmaceutical products, e.g. Into coated or uncoated granules or micro-granules. The 
xylanase to be included in the composition may be stabilized in accordance with methods 
known in the art e.g. by stabilizing the xylanase in the composition by adding and antioxidant 
15 or reducing agent to limit oxidation or the xylanase of it may be stabilized by adding polymers 
such as PVP, PVA, PEG or other suitable polymers known to be beneficial to the stability of 
polypeptides in solid or liquid compositions 

In a further embodiment the composition of the Invention Is a detergent composition 
which, in addition to the xylanase, comprises a surfactant and optionally compounds selected 
20 from the group consisting of builders such as zeolites, bleaching agents such as 
percarbonate, bleach enhancers such as TAED or NOBS, suds suppressors, fragrants, etc. 

In a further embodiment the composition of the invention is a feed composition which, 
in addition to the xylanase, comprises a cereal or grain product. 

In a further embodiment the composition of the invention is a baker flour composition 
25 comprising the xylanase of the invention. 

In a further embodiment the composition of the invention is fermentable composition, 
which in addition to the xylanase, comprises one or more nutrients for a microorganism 

In a further embodiment the composition of the invention is a pulping composition, 
which in addition to the xylanase, comprises pulp 

30 AppHcations of functional GH-61 xylanases 

In a still further aspect tiie invention relates to processes involving the xylanase of the inven- 
tion. The xylanases of the invention may suitably t>e applied in any process involving carbo- 
hydrates, particulariy hydrolysis of xylolosic substrates. 

In one embodiment the process of the invention is a cleaning process comprising 

35 contacting a cellulosic fabric to be cleaned with a xylanase of the invention. Such a process 
is particularly useful when the fabric is soiled with xylolosic soilings 
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In another embodiment the process of the invention Is a process for the preparation 
of food and/or feed composition comprising mixing a food/feed mixture with a xylanase of the 
invention. Xylanase treatment of animal feed and silage may release free pentose sugars or 
may reduce or modify the viscosity of xylan rich feed such as wheat. In particular the food is 
a bread product and the process further comprises baking a flour composition comprising the 
xylanase of the invention into a bread product. Xylanases are known to improve the quality of 
doughs. 

In yet another embodiment the process of the invention is a process for xylanase hy- 
drolysis of agricultural wastes for production of alcohol fuels 

In yet another embodiment the process of the invention is a brewing process wherein 
the presence of a xylanase may improve filterability of the wort. 

In yet another embodiment the process of the invention is a process for preparation of 
fruit or vegetable juices, wherein presence of a xylanase may improve filtration and increase 
yields. 

in yet another embodiment the process of the invention is a process for treatment of 
lignolosic materials and pulp with a xylanase, wherein the xylanase will dissolve xylolosic 
materials. 

The enzyme of the invention (produced by the method of the invention) may be provided in 
the form of a granulate, preferably a non-dusting granulate, a liquid, In particular a stabilized 
liquid, a slurry, or a protected enzyme. For certain applications, immobilization of the enzyme 
on a solid matrix may be preferred. 

DETERGENT DISCLOSURE AND EXAMPLES 

The xylanase of the invention may be added to and thus become a component of a 
detergent composition. 

The detergent composition of the invention may for example be formulated as a hand 
or machine laundry detergent composition including a laundry additive composition suitable 
for pre-treatment of stained fabrics and a rinse added fabric softener composition, or be for- 
mulated as a detergent composition for use in general household hard surface cleaning op- 
erations, or be formulated for hand or machine dishwashing operations. 

In a specific aspect, the invention provides a detergent additive comprising the en- 
zyme of the invention. The detergent additive as well as the detergent composition may com- 
prise one or more other enzymes such as a protease, a lipase, a cutinase. an amylase, a 
carbohydrase, a cellulase, a pectinase. a mannanase, an arabinase, a galactanase, a xy- 
lanase. an oxidase, e.g., a laccase, and/or a peroxidase. 
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In general the properties of the chosen enzyme(s) should be compatible with the se- 
lected detergent, (i.e. pH-optimum, compatibility with other enzymatic and non-enzymatic in- 
gredients, etc.), and the enzyme(s} should be present in effective amounts. 

Proteases: Suitable proteases include those of animal, vegetable or microbial origin. 
Microbial origin is preferred. Chemicaily modified or protein engineered mutants are included. 
The protease may t>e a serine protease or a metallo protease, preferably an alkaline micro- 
bial protease or a trypsin-like protease. Examples of alkaline proteases are subtilisins, espe- 
cially those derived from Bacillus, e.g., subtilisin Novo, subtilisin Carisberg, subtilisin 309, 
subtillsin 147 and subtilisin 168 (described in WO 89/06279). Examples of trypsin-like prote- 
ases are trypsin (e.g. of porcine or bovine origin) and the Fusarium protease described in 
WO 89/06270 and WO 94/25583. 

Examples of useful proteases are the variants described in WO 92/19729, WO 
98/20115, WO 98/201 16. and WO 98/34946, especially the variants with substitutions in one 
or more of the following positions: 27, 36, 57, 76, 87, 97. 101. 104, 120. 123, 167, 170, 194. 
206. 218. 222. 224, 235 and 274. 

Preferred commercially available protease enzymes include Alcalase®, Savinase®. 
Primase®. Duralase®, Esperase®, and Kannase® (Novozymes A/S), Maxatase®, Max- 
acal®, Maxapem®, Properase®, Purafecl®, Purafect OxP®, FN2®, and FN3® (Genencor 
International Inc.). 

Lipases: Suitable lipases include those of bacterial or fungal origin. Chemically 
modified or protein engineered mutants are included. Examples of useful lipases include li- 
pases from Humicola (synonym Thermomyces). e.g. from H. lanuginosa (T. lanuginosus) as 
described in EP 258 068 and EP 305 216 or from H. insolens as described in WO 96/13580. 
a Pseudomonas lipase, e.g. from P. alcaligenes or P. pseudoalcaligenes (EP 218 272). P. 
cepacia (EP 331 376), P. stutzeri (GB 1,372,034). P. fluorescens, Pseudomonas sp. strain 
SD 705 (WO 95/08720 and WO 96/27002), P. wisconsinensis (WO 96/12012), a Bacillus li- 
pase, e.g. from B. subtilis (Dartols et al. (1993), Biochemica et Biophysica Acta, 1131, 253- 
360), B. stearothermophilus (JP 64/744992) or B. pumilus (WO 91/16422). 

Other examples are lipase variants such as those described in WO 92/05249, WO 
94/01541. EP 407 225. EP 260 105. WO 95/35381. WO 96/00292, WO 95/30744, WO 
94/25578. WO 95/14783. WO 95/22615. WO 97/04079 and WO 97/07202. 

Preferred commercially available lipase enzymes include LipolaseTM and Lipolase 
UltraTM (Novozymes A/S). 

Amylases: Suitable amylases (alpha and/or beta) include those of bacterial or fungal 
origin. Chemically modified or protein engineered mutants are included. Amylases include, 
for example. D-amylases obtained from Bacillus, e.g. a special strain of B. lichenlfonmis, de- 
scribed In more detail in GB 1,296,839. 



33 



Examples of useful amylases are the variants described in WO 94/02597, WO 
94/18314. WO 96/23873. and WO 97/43424. especially the variants with substitutions in one 
or more of the following positions: 15, 23, 105, 106, 124, 128, 133. 154, 156, 181, 188, 190. 
197, 202, 208. 209, 243, 264. 304. 305, 391, 408, and 444. 

Commercially available amylases are Duramyl™, Termamyl'^'^, Fungamyl™ 
and BAN^ (Novozymes A/S). Rapidase™ and Purastar™ (from Genencor Intemational Inc.). 

Cellulases: Suitable cellulases include those of bacterial or fungal origin. Chemically 
modified or protein engineered mutants are included. Suitable cellulases Include cellulases 
from the genera Bacillus, Pseudomonas, Humicola, Fusarium. Thfelavia, Acremonium, e.g. 
the fungal cellulases produced from Humicola insolens, Myceliophthora thenmophlla and 
Fusarium oxysporum disclosed in US 4,435,307, US 5,648.263, US 5,691,178, US 
5,776,757 and WO 89/09259. 

Especially suitable cellulases are the alkaline or neutral cellulases having col- 
our care benefits. Examples of such cellulases are cellulases described in EP 0 495 257. EP 
0 531 372, WO 96/11262. WO 96/29397, WO 98/08940. Other examples are cellulase vari- 
ants such as those described in WO 94/07998. EP 0 531 315, US 5.457.046, US 5,686,593, 
US 5,763,254. WO 95/24471, WO 98/12307 and PCT/DK98/00299. 

Commercially available cellulases include Celluzyme®, and Carezyme® (No- 
vozymes), Clazinase®, and Puradax HA® (Genencor Intemational Inc.), and KAC-500(B) ® 
(Kao Corporation). 

Peroxidases/Oxidases: Suitable peroxidases/oxidases include those of plant, bacte- 
rial or fungal origin. Chemically modified or protein engineered mutants are Included. Exam- 
ples of useful peroxidases include peroxidases from Coprinus, e.g. from C. dnereus. and 
variants thereof as those described in WO 93/24618, WO 95/10602. and WO 98/15257. 

Commercially available peroxidases include Guardzyme® (Novozymes A/S). 

The detergent enzyme(s) may be included in a detergent composition by add- 
ing separate additives containing one or more enzymes, or by adding a combined additive 
comprising all of these enzymes. A detergent additive of the invention, i.e. a separate addi- 
tive or a combined additive, can be formulated e.g. as a granulate, a liquid, a slurry, etc. Pre- 
ferred detergent additive formulations are granulates, in particular non-dusting granulates, 
liquids, in particular stabilized liquids, or slurries. 

Non-dusting granulates may be produced, e.g., as disclosed in US 4,106,991 
and 4,661,452 and may optionally be coated by methods known in the art. Examples of waxy 
coating materials are poly(ethylene oxide) products (polyethyleneglycol, PEG) with mean 
molar weights of 1000 to 20000; ethoxylated nonylphenols having from 16 to 50 ethylene ox- 
ide units; ethoxylated fatty alcohols in which the alcohol contains from 12 to 20 carbon atoms 
and in which there are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; and mono- 
and di- and triglycerides of fatty acids. Examples of film-fomiing coating materials suitable for 
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application by fluid bed techniques are given in GB 1483591. Liquid enzyme preparations 
may, for instance, be stabilized by adding a polyol such as propylene glycol, a sugar or sugar 
alcohol, lactic acid or boric acid according to established methods. Protected enzymes may 
be prepared according to the method disclosed in EP 238,216. 

The detergent composition of the invention may be in any convenient form, 
e.g., a bar, a tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aque- 
ous, typically containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 

The detergent composition comprises one or more surfactants, which may be 
non-ionic including semi-polar and/or anionic and/or cationic and/or zwitterionic. The surfac- 
tants are typically present at a level of from 0.1% to 60% by weight 

When included therein the detergent will usually contain from about 1% to 
about 40% of an anionic surfactant such as linear alkylbenzenesulfonate, alpha- 
olefinsulfonate, alkyl sulfate (fatty alcohol sulfate), alcohol ethoxysulfate, secondary alkane- 
sulfonate, alpha-sulfo fatty acid methyl ester, alkyl- or alkenylsucclnic acid or soap. 

When included therein the detergent will usually contain from about 0.2% to 
about 40% of a non-ionic surfactant such as alcohol ethoxylate, nonylphenol ethoxylate, al- 
kylpolyglycoside, alkyldimethylamineoxide, ethoxylated fatty add monoethanolamide, fatty 
add monoethanolamide, polyhydroxy alkyl fatty acid amide, or N-acyl N-alkyI derivatives of 
glucosamine ("glucamides"). 

The detergent may contain 0-65 % of a detergent builder or complexing agent 
such as zeolite, diphosphate, triphosphate, phosphonate. carbonate, dtrate, nitrilotriacetic 
add, ethylenediaminetetraacetic acid, diethylenetriaminepentaacetic acid, alkyl- or alkenyl- 
succlnic acid, soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). 

The detergent may comprise one or more polymers. Examples are carboxy- 
methylcellulose, poly(vinylpyrrolldone). poly (ethylene glycol), poly(vinyl alcohol), 
poly(vinylpyridine-N-oxide), poly(vinylimidazole), polycarboxylates such as polyacrylates, 
malerc/acrylic add copolymers and lauryl methacrylate/acrylic add copolymers. 

The detergent may contain a bleaching system which may comprise a H202 
source such as perborate or percarbonate which may be combined with a peracid-forming 
bleach activator such as tetraacetylethylenediamine or nonanoyloxyt>enzenesulfonate. Alter- 
natively, the bleaching system may comprise peroxyadds of e.g. the amide, imide, or sulfone 
type. 

The enzyme(s) of the detergent composition of the invention may be stabilized 
using conventional stabilizing agents, e.g., a polyol such as propylene glycol or glycerol, a 
sugar or sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g., an aromatic bo- 
rate ester, or a phenyl boronic add derivative such as 4-formylphenyl boronic add. and the 
composition may be formulated as described in e.g. WO 92/19709 and WO 92/19708. 
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The detergent may also contain other conventional detergent ingredients such 
as e.g. fabric conditioners including clays, foam boosters, suds suppressors, anti-corrosion 
agents, soil-suspending agents, anti-soil redeposition agents, dyes, bactericides, optical 
brighteners, hydrotropes, tarnish inhibitors, or perfumes. 
5 It is at present contemplated that in the detergent compositions any enzyme, 

in particular the enzyme of the invention, may be added in an amount corresponding to 0.01- 
100 mg of enzyme protein per liter of wash liqour, preferably 0.05-5 mg of enzyme protein 
per liter of wash liqour, in particular 0.1-1 mg of enzyme protein per liter of wash liqour. 

The enzyme of the invention may additionally be incorporated in the detergent 
10 formulations disclosed in WO 97/07202 which is hereby incorporated as reference. 

EXAMPLES 

Example 1 : Isolation of DNA encoding 6H-61 polypeptides from Thelavia terrestris 
Mycelium of a T. terrestris is grown on MEX-1 to induce proteins that respond to complex 
cellulosic medias. After 4 days at 37 ''C. the mycelium is harvested by filtration though 

15 Whatman 1MM filter paper. Mycelium is frozen in liquid nitrogen and stored until further use. 
RNA Isolation is performed according to the protocol of Chomczynlski and Sacchi, 1987 
(Analytical Biochemistry 162: 156-159). The Poly(A) Quik mRNA isolation kit is used to purify 
polyA enriched RNA for cDNA production (Stratagene USA). Production of a cDNA library is 
achieved according to the SMART cDNA library construction kit (Clontech USA). Sfil re- 

20 stricted double strand cDNAs are cloned into Lambda TriplEx vector and the plasmid con- 
taining colonies is recovered by mass excision according to the SMART protocol. 

Individual plasmlds containing cDNAs are prepared using existing 96 well silica based 
plasmid preparation systems such as Qiagen Qia-turbo 96, Qiagen corp. USA). Once plas- 
mid template is prepared, all plasmids are sequenced with the vector fonA^ard primer using 

25 conventional sequencing methods and equipment 

Identification of GH61 Expressed Sequence Tags (ESTs) are achieved by searching 
the entire non redundant protein database (for example, SWALL) with a program such as 
BlastP (Altschul. S.F., Gish. W.. Miller. W., Myers. E.W. & Llpman, D.J. (1990) "Basic local 
alignment search tool." J. Mol. Biol. 215:403-410. Gish, W. & States. D.J. (1993) "Identifica- 

30 tion of protein coding regions by datat>ase similarity search." Nature Genet 3:266-272. 
Madden. T.L.. Tatusov, R.L. & Zhang, J. (1996) "Applications of networic BI-AST server 
Meth. Enzymol. 266:131-141. Altschul, S.F., Madden, T,L. Schaffer, A.A, Zhang, J., Zhang. 
Z.. Miller. W. & Lipman, D.J. (1997) "Gapped BLAST and PSI-BLAST: a new generation of 
protein database search programs." Nucleic Acids Res. 25:3389-3402. Zhang. J. & Madden. 

35 T.L. (1997) "PowerBLAST: A new network BLAST application for interactive or automated 
sequence analysts and annotation." Genome Res. 7:649-656. 
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Sequences with similarities to existing GH-61 family sequences are identified based 
on a blast probability score matrix. 

A done containing SEQ ID NO: 1 was selected for further analysis 

Example 2: Isolation of DNA encoding GH-61 polypeptides from Coprinus cinereus 
5 The procedure of example 1 was repeated for a strain of Humicola insotens, A clone contain- 
ing SEQ ID NO: 3 was selected for further analysis. 

Example 3: Isolation of DNA encoding 6H-61 polypeptides from Humicola insolens 
The procedure of example 1 was repeated for a strain of Coprinus cinereus for example Co- 
prinius cinerus (CBS394.65) obtainable from the CBS. A clone containing SEQ ID NO: 5 was 
1 0 selected for further analysis. 

Example 4: Construction of an Aspergillus expression vector for GH61 DNA se- 
quences 

SEQ ID NO: 3 was amplified in the following manner: 1 microliter of cDNA (approximately 10 
nanograms of DNA) was used as template in a PCR reaction with the two primers A and B. 

15 

Primer A: 5'-GCGGAATTCATCATGAGGCCCTTCTCCCTC-3' (SEQ ID NO:7) 

Primer B: 5'-ATTTGCGGCCGCTTCCCGTCATCCTCTAAGGC-3' (SEQID NO:8) 

SEQ ID NO: 1 and 5 can be amplified in a similar manner using the primers: 

20 

For SEQ ID NO: 1: 

Primer A: 5'-GCGGAATTCATGAAGCTCACCACCTCGGT-3' (SEQ ID NO:9) 

Primer B: 5'-ATTTGCGGCCGCGCAGCCAACCAACCTGGAAT-3* (SEQ ID NO: 1 0) 

26 For SEQ ID NO: 5: 

Primer A: 6'-GCGGAATTCACAATGAAGGTCTTCGCATAC-3' (SEQ ID NO:1 1 ) 

primer B: 5'-ATTTGCGGCCGCACGATGCGATGAGCA7TTAT-3' (SEQ ID NO:12) 

5 pmoles of each primer was used in a 50 microliter reaction volume. The Qiagen 
30 ProofStart high fidelity DNA polymerase and buffer were used according to the manufactur- 
ers instructions (Qiagen, USA). Briefly, the reaction was placed in a thermal cycler (MJ Re- 
search, Dyad, USA) and cycled under the following reaction conditions: An initial denatura- 
tion of 5 minutes at 95 degrees Celsius, 25 cycles of the following: 94 degrees-30 seconds, 
55 degrees-30 seconds, 72 degrees 2 minutes. A final extension temperature of 72 degrees 
35 for 10 minutes was then used. Aliquots of the PCR reaction were separated on a 1% aga- 
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rose gel. One distinct band was seen: The size of this band (1.1 kb) corresponded well with 
the predicted size of the open reading frame. 

The fragment was digested with EcoRI and NotI which cut In the overhangs intro- 
duced by the PCR primers. The digested fragments were isolated and cloned into pMStr54, 
5 an Aspergillus expression plasmid based on the plasmid pCaHj527 (see the examples of in- 
ternational patent application WO 00/70064) constructed as described in example 7 of WO 
02/12472. Plasmid DNA was isolated from colonies of the cloning experiment. The colonies 
were sequenced with vector primers PNA2I (5'-GTT TCC AAC TCA ATT TAC CTC-3') and 
MHasSNotI (5-TTG CCC TCA TCC CCA TCC TrT-3') which prime in opposite directions into 
10 the plasmid insert. It was detenmined that no errors were introduced In any of the insert se- 
quences as a result of PCR. 

Example 5: Expression of SEQ ID NOS: 1, 3 and 5 In Aspergillus: 

SEQ ID NOS: 1, 3 and 5 were transfomned Into Aspergillus oryzae strain JAL355 (dis- 
closed in international patent application WO 01/98484A1). Transformants of SEQ ID NOS: 

15 1,3 and 5 were re-isolated twice under selective and noninducing conditions on Cove 
minimal plates with 1M sucrose as a carbon source and lOmM nitrate. (See recipe of the 
manufacturer) To test expression of SEQ ID NOS: 1.3 and S. transformants were grown for 3 
days and 4 days at 30 degrees Celsius in tubes with 10 ml YPM (2% peptone, 1% yeast ex- 
tract. 2% maltose). Supematants were run on NuPage 10% Bis-Tris SDS gels (Invitrogen) as 

20 recommended by the manufacturer. All AspergHlus isolates grew well even when induced for 
the expression of SEQ ID NOS: 1, 3 and 5. 

Example 6: Purification of SEQ ID NO: 2 from expression of SEQ ID NOS: 1 in Asper- 
gillus: 

A culture supernatant from a fermentation of the Aspergillus oryzae strain expressing SEQ ID 
25 NO: 1 was filtered through a 0,22 Mm filter to remove the mycelia. 350 ml of the filtered su- 
pernatant was diluted to 1450 ml with water and pH adjusted to 7.5 resulting in a conductivity 
of 1.8 mS/cm. This solution was loaded onto a 50 ml Q-Sepharose anion exchange column 
equilibrated with 25 mM Tris pH 7.5. The column was washed with about 15 column volumes 
of 25 mM Tris plH 7.5 and bound proteins were eluted with a NaCI gradient increasing linearly 
30 from 0 to 0.5 M over 20 column volumes. From SDS-PAGE it was seen that a polypeptide 
with a molecular weight of about 29 kDa was eluted during the wash with 25 mM Tris, pH 
7.5. Fractions containing the 29 kDa polypeptide were pooled and concentrated on an Ami- 
con ultrafiltration device with a 6 KDa cut off filter (Dow. GR 81 PP). The concentrated pool 
was at least 95% pure as estimated from SDS-PAGE and N-terminal sequencing of the poly- 
35 peptide gave the sequence con-esponding to SEQ ID NO: 2 (HYTFPQTDINGQLSGE). 
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Example 7: Purification of SEQ ID NO: 4 from expression of SEQ ID NOS: 3 in Asper- 
gillus: 

A culture supernatant from a fermentation of the Aspergillus oryzae strain expressing SEQ ID 
NO: 3 was filtered tlirough a 0.22 \im filter to remove the mycelia. 500 ml of the filtered su- 
5 pematant was ultrafiltrated in a Fiitron device with a 10 t^Da cut off membrane. pH was ad- 
justed to 9.5 and the filtrate loaded onto a 50 ml Q-Sepharose anion exchange column 
equilibrated with 25 mM glycine pH 9.5. The column was washed with about two column vol- 
umes of 25 mM glycine pH 9.5 and bound proteins were eluted with a NaCi gradient increas- 
ing linearly from 0 to 0.5 M over 20 column volumes. From SDS-PAGE It was seen that a 
10 polypeptide with a molecular weight of about 32 kDa was eluted during the wash with 25 mM 
glycine pH 9.5. Fractions containing the 32 kDa polypeptide were pooled and concentrated 
on an Amicon ultrafiltration device with a 20 kDa cut off filter (DDS, GR 61 PR). Purity of the 
concentrated pool was around 85% as estimated from SDS-PAGE and concentration around 
2.4 mg/ml estimated from absorbance at 280 nm and theoretical extinction coefficient. 

1 5 Example 4: Testing activity of SEQ ID NOS: 2, 4 and 6 

An oat AZCL-xylan, a birchwood AZCL-xylan and a wheat AZCL arabinoxylan sub- 
strate suspension were prepared by suspending 2 milligram per mllillitre of the substrate in a 
Nap buffer, pH 7 containing 0.0225 % w/w Brij. 

Activity of the purified SEQ ID NOS 2, 4 and 6 were tested by mixing 500 microliter 

20 substrate suspension with 100 microliter of a solution containing isolated SEQ ID NOS 2, 4 
or 6. This mixture was incubated at 37 **C. wherein after undigested substrate was sedimen- 
tated by centrifugation and digested substrate was evaluated by measuring absorbance of 
the supernatant at 590 nanometers. A blank value detenmlned by replacing for one sample 
the enzyme solution with a buffer was subtracted the absorbance measurements. 

25 



Results: 



GH-61 polypeptide 


Absorbance for 
Oat AZCL-xylan 


Absorbance for 
Birchwood AZCL- 
xylan 


Absorbance for 
Wheat AZCL- 
arabinoxylan 


SEQ ID NO: 2 


0,0172 


0,1077 


0.1296 


SEQ ID NO: 4 


0.0798 


0,4017 


0.8128 


SEQ ID NO: 6 


0.0536 


0.2064 


0.6101 



These results showed for the first time GH-61 polypeptides having xylanase activity. 
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CLAIMS 

1. An isolated GH-61 polypeptide having xylanase activity. 

5 2. The xylanase of claim 1, comprising in it naturally occurring mature form conserved 
portions of a H at position 1 . A or P at position 59, G at position 60, G at position 75, P or A 
at position 76. W or F at position 100, F or T at position 101, K or C at position 102, I or V or 
L at position 103, L or I or V or M at position 130, P at position 131, G and Xaa and Y at posi- 
tion 137-139, L or V or I or M at position 140, L or V or I or M at position 141 , R at position 
10 142, E or Q at positions 143-144, L or V or I at position 148, H or N at position 149, C at posi- 
tion 163 and P and G and P at position 209-21 1. 

3. The xylanase of claim 2 comprising the amino acid sequence of the mature xylanases 
of SEQ ID NO:2. SEQ ID NO:4 or SEQ ID NO:6, an allelic variant thereof; or a fragment 

1 5 thereof that has the xylanase activity. 

4. The xylanase of daim 3 comprising an amino acid sequence which has at least 65% 
identity to the amino acids of the mature xylanases of SEQ ID NO:2, SEQ ID NO:4 or SEQ 
ID NO:6. 

20 

5. The xylanase of claim 4 differing at the most by ten amino acids from amino acids of 
the mature xylanases of SEQ ID N0:2, SEQ ID N0:4 or SEQ ID NO:6 

6. The xylanase of claim 5 consisting of the amino acid of the mature xylanases of SEQ 
25 ID N0:2, SEQ ID NO:4 or SEQ ID NO:6. 

7. The xylanase of daim 2 comprising at least one substitution, deletion or insertion out- 
side the conserved portions of the amino add sequence. 

30 8. The xylanase of claim 2 comprising at least one substitution inside the conserved por- 
tions. 

9. The xylanase of daim 7, wherein the number of substitutions, deletions or insertions 
is at the most 10. 

35 
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10. The xylanase of daim 1 wherein the xylanase is encoded by nucleotide sequences 
which hybridize under very low stringency conditions with a polynucleotide probe selected 
from the group consisting of 

(i) the complementary strand of nucleotides 52 to 699 of SEQ ID NO:1. 46 to 957 of 
SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5. 

(ii) the complementary strand of the cDNA sequence contained in nucleotides 52 to 699 
of SEQ ID NO:1 . 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 

(iii) the complementary strand of nucleotides 46 to 857 of SEQ ID NO:3, 

(iv) the complementary strand of nucleotides 52 to 300 of SEQ ID NO:1. 46 to 501 of 
SEQ ID NO:3 or 58 to 300 of SEQ ID NO:5 or 

(v) the complementary strand of nucleotides 301 to 699 of SEQ ID NO:1, 502 to 957 of 
SEQ ID N0:3 or 301 to 660 of SEQ ID NO:5. 

1 1 . The xylanase of claim 10, wherein the xylanase is encoded by a polynucleotide com- 
prising the nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1, 46 to 957 of SEQ 
ID NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID NO:1 . 
46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of the 
genetic code 

12. The xylanase of claim 1 1 , wherein the xylanase is encoded by a polynucleotide 
consisting of nucleotide sequence of nucleotides 52 to 699 of SEQ ID NO:1. 46 to 957 of 
SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 or sequences differing from 52 to 699 of SEQ ID 
NO:1 , 46 to 957 of SEQ ID NO:3 or 58 to 660 of SEQ ID NO:5 by virtue of the degeneracy of 
the genetic code. 

13. The xylanase of daim 1 , obtained from a fungus. 

14. A polynudeotide comprising a nudeotide sequence encoding a xylanase of daims 1 
to 13. 

15. The polynudeotide of claim 14, wherein the nucleotide sequence comprise a subse- 
quence which encode: 

(a) a fragment of SEQ ID NO:2 that exhibit xylanase activity and contains the conserved 
portions, 

(b) a fragment of SEQ ID NO:4 that exhibit xylanase activity and contains the conserved 
portions or 

(c) a fragment of SEQ ID NO:6 that exhibit xylanase activity and contains the conserved 
portions. 
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16. The polynucleotide of claim 15, wherein the nucleotide sequence comprise a subse- 
quence of: 

(a) SEQ ID NO:1 which encode fragments of SEQ ID NO:2 that exhibit xylanase activity 
and contains the conserved portions. 

(b) SEQ ID NO:3 which encode fragments of SEQ ID NO:4 that exhibit xylanase activity 
and contains the conserved portions or 

(c) SEQ ID NO:5 which encode fragments of SEQ ID NO:6 that exhibit xylanase activity 
and contains the conserved portions. 

17. The polynucleotide of claim 16, comprising a nucleotide sequence encoding the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 or the mature xy- 
lanase thereof. 

15 18. The polynucleotide of dalm 17, comprising the nucleotide sequence of the mature 
xylanase coding region of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5 

1 9. The polynucleotide of claim 18, consisting of the nucleotide sequence of the mature 
xylanase coding region of SEQ ID NO:1 , SEQ ID N0:3 or SEQ ID NO:5 

20 

20. A nucleic acid construct comprising a nucleotide sequence of claims 14-19 linked to 
one or more control sequences that direct the production of the polypeptide in a suitable 
host. 

25 21 . A recombinant expression vector comprising the nudeic add construct defined in 
dalm 20. 

22. A recombinant host cell comprising the nudeic add construct of dalm 20. 
30 23. A method for produdng the xylanase of claims 1 to 13 comprising: 

(a) cultivating a strain, which in its wild-type fonri is capable of produdng the xylanase, to 
produce the xylanase; and 

(b) recovering the xylanase. 



35 



24. A method for producing the xylanase of claims 1 to 13, the method comprising: 
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(a) cultivating a recombinant host cell as defined in claim 22 under conditions conducive 
for production of the xylanase; and 

(b) recovering the xylanase. 

5 

25. A transgenic plant comprising a nucleotide sequence of the invention and capable of 
expressing the xylanase of claims 1 to 13. 

26. A composition comprising the xylanase of claims 1 to 13. 

10 

27. The composition of claim 26, further comprising a surfactant 
2S. The composition of claim 26 further comprising a cereal product 

1 5 29. Use of a xylanase of claims 1 to 1 3 for hydrolyzing a xylolosic substrate. 

30. Use of a xylanase of claims 1 to 13 for preparing an animal feed composition 

31 . Use of a xylanase of claims 1 to 1 3 for preparing a dough composition. 

20 

32. Use of a xylanase of claims 1 to 1 3 for preparing alcohol fuel. 

33. Use of a xylanase of claims 1 to 1 3 for treatment of brewing wort. 

25 34. Use of a xylanase of claims 1 to 1 3 for preparation of fruit or vegetable juice. 

35. Use of a xylanase of claims 1 to 1 3 for cleaning a fabric 

36. Use of a xylanase of daims 1 to 1 3 for treatment of pulp. 

30 
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ABSTRACT 

The present invention relates to an isolated GH-61 polypeptide having xylanase activity. 
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SEQUENCE LISTING 



<110> 

<120> 

<130> 

<160> 

<170> 

<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



Novozymes A/S 
GH-61 xylanases 
10274/Novozymes 
12 

Patentin version 
1 

699 
DMA 

T. terrestris 



si^_peptide 



3-1 



(51) 



mat^peptide 
C52)..0 



CDS 

CI). 



(699) 



<400> 1 

atg aag etc acc acc teg 
Met Lys Leu Thr Thr Ser 
-15 

gcc cac tac acc ttc ccg 
Ala His Tyr Thr Phe Pro 
-11 5 

gaa tgg gtg acg ate egt 
Glu Trp vai Thr lie Arg 
20 

gtc acc gac gtc acg teg 
val Thr Asp val Thr ser 
35 



gtc gee etg 
val Ala Leu 
-10 

cag acg gac 
Gin Thr Asp 



gag acg acg 
Glu Thr Thr 



gac cag ate 
Asp Gin lie 
40 



ett gcc 
Leu Ala 



ate aac 
lie Asn 
10 

aac cac 
Asn His 
25 

egc tge 
Arg Cys 



gcg gcg ggc 
Ala Ala Gly 

ggc cag etc 
Gly Gin Leu 



gcg caa 
Ala Gin 



tec ggc 
ser Gly 



tac teg cac 
Tyr ser His 



tac gag etc 
Tyr Glu Leu 
45 



15 



ggg ccg 
Gly Pro 
30 

aac ccg 
Asn Pro 



ggc ace cec gcg ecc cag ate gee aeg 
Gly Thr Pro Ala Pro Gin lie Ala Thr 
50 55 

acc ttc ace gtc gac cea age ate cag 
Thr Phe Thr val Asp pro ser He Gin 
65 70 



tac atg gee aag gcg ccg 
Tyr Met Ala Lys Ala Pro Ser Gly 
80 85 



teg ggc cag 
niy Gin 



acc ggc aac gtc tgg ttc 
Thr Gly Asn Val Trp Phe 
100 

ggc acc age aac ate acc 
Gly Thr ser Asn lie Thr 
115 



aag ate tac 
Lys lie Tyr 



tgg cet age 
Trp Pro Ser 
120 



gtg cag 
val Gin 



cac ccg 
His Pro 



acc gcc 
Thr Ala 
90 

gag gac 
Glu Asp 
105 

age ggc 
Ser Gly 
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gee gac ggc 
Ala Gly Gly 
60 

ggc ccg etg 
Gly Pro Leu 
75 

gcc acc ttc 
Ala Thr Phe 



ggc ccg tec 
Gly Pro Ser 



aaa acc gaa 
Lys Thr Glu 
125 



acc gtg 
Thr val 



cag ttc 
Gin Phe 



cag ggc 
Gin Gly 
95 

ggc etc 
Gly Leu 
110 

gtc age 
val ser 
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gtc aag ate ccc teg tgc ate geg ccg ggc gae tac etc etg cge gtg 
val Lys lie Pro ser cys lie Ala Pro Gly Asp Tyr Leu Leu Arg vai 
130 135 140 



tac etc geg tgc gee cag etg acc gtc ace ggc ggc ace ggc acc etc 
Tyr Leu Ala Cys Ala Gin Leu Thr val Thr Gly Gly Thr Gly Thr Leu 
160 165 170 175 



480 



gag cae ate geg etg eac age gee age acc gtc ggc ggc gee cag tte 528 
61 u His lie Ala Leu His Ser Ala ser Thr Val Gly Gly Ala Gin Phe 
145 150 155 



576 



aac aeg ggc gag etc gtc gee tte ccc ggc gee tac age gee ace gae 624 
Asn Thr Gly Glu Leu val Ala Phe Pro Gly Ala Tyr ser Ala Thr Asp 
180 185 190 



ccg ggc ate etc tte cag etg tac tgg eec ate ccg acc age tac acc 672 
Pro Gly lie Leu Phe Gin Leu Tyr Trp Pro He Pro Thr Ser Tyr Thr 
195 200 205 



aac ccc ggc ccg geg ccc gtt age tgc 699 
Asn Pro Gly Pro Ala Pro val ser cys 
210 215 

<210> 2 
<211> 233 
<212> PRT 

<213> T. terrestris 
<400> 2 

Met Lys Leu Thr Thr ser Val Ala Leu Leu Ala Ala Ala Gly Ala Gin 
-15 -10 -5 

Ala His Tyr Thr Phe Pro Gin Thr Asp lie Asn Gly Gin Leu Ser Gly 
-11 5 10 15 

Glu Trp Val Thr lie Arg Glu Thr Thr Asn His Tyr Ser His Gly Pro 
20 25 30 

val Thr Asp val Thr Ser Asp Gin lie Arg cys Tyr Glu Leu Asn Pro 
35 40 45 

Gly Thr Pro Ala Pro Gin lie Ala Thr val Gin Ala Gly Gly Thr val 
50 55 60 

Thr Phe Thr val Asp Pro ser lie Gin His Pro Gly Pro Leu Gin Phe 
65 70 75 

Tyr Met Ala Lys Ala Pro ser Gly Gin Thr Ala Ala Thr Phe Gin Gly 
80 85 90 95 

Thr Gly Asn val Trp Phe Lys lie Tyr Glu Asp Gly Pro ser Gly Leu 
100 105 110 

Gly Thr ser Asn lie Thr Trp Pro Ser ser Gly Lys Thr Glu val Ser 
115 120 125 
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val Lys lie Pro ser cys lie Ala Pro Gly Asp Tyr Leu Leu Arg val 
IBO 135 140 

Glu His He Ala Leu His ser Ala ser Thr Val Gly Gly Ala Gin Phe 
145 150 155 

Tyr Leu Ala Cys Ala Gin Leu Thr val Thr Gly Gly Thr Gly Thr Leu 
160 165 170 175 

Asn Thr Gly Glu Leu val Ala Phe Pro Gly Ala Tyr ser Ala Thr Asp 
180 185 190 

Pro Gly lie Leu Phe Gin Leu Tyr Trg Pro lie Pro Thr ser Tyr Thr 



195 200 205 

Asn Pro Gly Pro Ala Pro val ser Cys 
210 215 



<210> 3 

<211> 957 

<212> DNA 

<213> Humicola insolens 
<220> 

<221> CDS 

<222> (1) . . C957) 

<223> 

<220> 

<221> sig^eptide 

<222> (l5..(45) 
<223> 

<220> 

<221> mat^peptide 

<222> (46) . . O 
<223> 

<220> 

<221> mi sc_f eature 

<222> (858).. (957) 

<223> CBMl domain 

<400> 3 

atg agg ccc ttc tec etc gtc gcg ctg gcg acg gcc gtc age gge cac 48 
Met Arg Pro Phe ser Leu val Ala Leu Ala Thr Ala val Ser Giy His 
-15 -10 -5 -1 1 



gcc ate ttc cag cgc gtg teg gtt aac ggt gtc gac caa gge cag etc 
Ala He Phe Gin Arg val Ser val Asn Gly val Asp Gin Gly Gin Leu 

5 10 15 



96 



aag gge gtq egg get ccc teg age aac tac ccc att gag aac gtc aac 144 

Lys Gly val Arg Ala Pro Ser Ser Asn Tyr pro He Glu Asn val Asn 
20 25 30 

cac ccc gac ttt gcc tge aac ace aac ate egg cac cgc gac gge ace 192 

His Pro Asp Phe Ala cys Asn Thr Asn He Arg His Arg Asp Gly Thr 
35 40 45 
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gtc ate aag ate ccc gcc gqc gcc acc gtc ggc gcc tgg tgg cag cac 240 

val lie Lys He pro Ala Gly Ala Thr Val Gly Ala Trp Trp Gin His 

50 55 60 65 

gag ate ggc ggg ccc teg ttc ccc ggt gat ccg gat aac ccg ate get 288 
61 u lie Gly Gly Pro Ser Phe Pro Gly Asp Pro Asp Asn Pro lie Ala 
70 75 80 

gcc teg cac aag ggc ccc ate caa gtc tat etc gcc aag gtc gae aac 336 
Ala ser His Lys Gly Pro lie Gin val Tyr Leu Ala Lys val Asp Asn 
85 90 95 

gcc get acc gcc tet ccc aac ggc ctg egg tgg ttc aag att gcc gag 384 
Ala Ala Thr Ala Ser Pro Asn Gly Leu Arg Trp Phe Lys ile Ala Glu 
100 105 110 

aag ggc ctg teg ggc ggc gtc tgg gcc gtc gae gag atg ate cgc aac 432 
Lys Gly Leu ser Gly Gly val Trp Ala Val Asp Glu Met lie Arg Asn 
115 120 125 

aac ggc tgg cac tac ttc acc atg ccg cag tgc ate geg ccc ggc cac 480 
Asn Gly Trp His Tyr Phe Thr Met Pro Gin Cys Ile Ala Pro Gly His 
130 135 140 145 

tac ctg atg cgc gtc gag ctg ctg gcg ctg cac teg gee age ttc ccc 528 
Tyr Leu Met Arg val Glu Leu Leu Ala Leu His Ser Ala ser Phe Pro 
150 155 160 

ggc ggc gcc cag ttc tac atg gag tgc gcc cag ate gag gtc acc ggc 576 
Gly Gly Ala Gin Phe Tyr Met Glu Cys Ala Gin lie Glu Val Thr Gly 
165 170 175 

teg ggc aac ttc teg ccc tec gag acg gtc age ttc ccc ggc gcc tac 624 
ser Gly Asn Phe ser Pro Ser Glu Thr val Ser Phe Pro Gly Ala Tyr 
180 185 190 



ccg gcc aac cac ccg ggt ate gtc gtc age ate tac gae gee cag ggt 
Pro Ala Asn His Pro Gly He val val Ser Ile Tyr Asp Ala Gin Gly 
195 200 205 



ate acc tgc tec ggc ggt gga age aac aat ggt ggc ggg aac aac aat 
lie Thr Cys Ser Gly Gly Gly Ser Asn Asn Giy Gly Gly Asn Asn Asn 
230 235 240 

ggt ggt gga aac aat aac ggc ggc ggc ggc aac aac aac ggc ggt ggg 
Gly Gly Gly Asn Asn Asn Gly Gly Gly Gly Asn Asn Asn Giy Gly Gly 
245 250 255 



ggc cag tgc ggc ggc aat ggg tat tec ggc ccg acg act tgc gcc gag 
Gly Gin Cys Gly Gly Asn Gly Tyr Ser Gly Pro Thr Thr Cys Ala Glu 
275 280 285 



672 



aac gee aac aac ggc ggg cgc gag tac cag ate ccc ggg ccg egg ccg 720 
Asn Ala Asn Asn Gly Gly Arg Glu Tyr Gin lie Pro Gly pro Arg pro 
210 215 220 225 



768 



816 



aac aac aac ggt ggc ggt aac acc ggt ggc ggc teg gcg ccg etc tgg 864 
Asn Asn Asn Gly Gly Gly Asn Thr Giy Gly Gly Ser Ala Pro Leu Trp 
260 265 270 



912 



ggt act tgc aag aag cag aat gae tgg tac teg cag tgt acg cet 957 
Gly Thr cys Lys Lys Gin Asn Asp Trp Tyr ser Gin Cys Thr Pro 
290 295 300 

<210> 4 
<211> 319 
<212> PRT 
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<213> Humlcola 1nso1 ens 
<400> 4 

Met Arg Pro Phe ser Leu val Ala Leu Ala Thr Ala val ser Gly His 
-15 -10 -5 -1 1 

Ala lie Phe Gin Arg val Ser val Asn Gly val Asp Gin Gly Gin Leu 
5 10 15 

Lys Gly Val Arg Ala Pro ser ser Asn Tyr Pro lie Glu Asn val Asn 
20 25 30 

His Pro Asp Phe Ala Cys Asn Thr Asn lie Arg His Arg Asp Gly Thr 
35 40 45 

Val lie Lys lie Pro Ala Gly Ala Thr Val Gly Ala Trp Trp Gin His 
50 55 60 65 

Glu lie Gly Gly Pro Ser Phe Pro Gly Asp Pro Asp Asn Pro lie Ala 
70 75 80 

Ala Ser His Lys Gly Pro He Gin val Tyr Leu Ala Lys val Asp Asn 
85 90 95 

Ala Ala Thr Ala Ser Pro Asn Gly Leu Arg Trp Phe Lys lie Ala Glu 
100 105 110 

Lys Gly Leu ser Gly Gly val Trp Ala Val Asp Glu Met He Arg Asn 
115 120 125 

Asn Gly Trp His Tyr Phe Thr Met Pro Gin Cys lie Ala Pro Gly His 
130 135 140 145 

Tyr Leu Met Arg val Glu Leu Leu Ala Leu His Ser Ala ser Phe Pro 
150 155 160 

Gly Gly Ala Gin Phe Tyr Met Glu cys Ala Gin lie Glu Val Thr Gly 
165 170 175 

ser Gly Asn Phe Ser Pro Ser Glu Thr val Ser Phe Pro Gly Ala Tyr 
180 185 190 

Pro Ala Asn His Pro Gly lie val Val Ser He Tyr Asp Ala Gin Gly 
195 200 205 

Asn Ala Asn Asn Gly Gly Arg Glu Tyr Gin lie Pro Gly Pro Arg pro 
210 215 220 225 

lie Thr Cys Ser Gly Gly Gly ser Asn Asn Gly Gly Gly Asn Asn Asn 
230 235 240 
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Gly Gly Gly Asn Asn Asn Gly Gly Gly Gly Asn Asn Asn Gly Gly Gly 
245 250 255 
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CCC gac ggc cag age cag etc tec ace ace ate ccc tec aae act ccc 432 

Pro Asp Gly Gin ser Gin Leu Ser Thr Thr lie Pro Ser Asn Thr Pro 

110 115 120 125 

age ggc gag tac etc etc cgc ate gag cac ate ggt etc cae gtc gee 480 

ser Gly Glu Tyr Leu Leu Arg He Glu His lie Gly Leu His Val Ala 

130 135 140 



ggc get ccc caa tac tac ate tec tgc gee caa ate cgc gtt aat ggc 
Gly Ala Pro Gin Tyr Tyr He ser Cys Ala Gin lie Arg val Asn Gly 
145 150 155 



<210> 6 
<211> 220 
<212> PRT 

<213> coprinus cinereus 
<400> 6 

Met Lys Val Phe Ala Tyr val Ala Leu Leu Ala Ala Ala Ala Gin ser 
-15 -10 -5 

Ala Ser Ala His Tyr He Trp Thr Thr Leu Thr Ala Gly Gly Gin Thr 
-11 5 10 

Thr ser Ala val lie Arg Gin Pro Leu Asn Asn Ser Pro val Glu Asp 
15 20 25 

Val Ser ser Pro His Met Arg cys Asn val Asn Pro Met Pro Ala ser 
30 35 40 45 

Gin Thr Leu Asn val Gin Ala Gly Ser Ser val Thr Phe Arg Leu Asp 
50 55 60 



Asn Thr Leu 



Tyr His Pro Gly Pro Ala Ala lie Tyr Leu Gly Gin val 
65 70 75 



Pro Ala Gly Gin Thr Ala Ala ser Trp Asp Gly ser Gly Ala Asn Trp 
80 85 90 

Phe Lys He Asp Glu Phe Gly Ala Gin Phe Asn Pro Phe Arg Phe He 
95 100 105 

Pro Asp Gly Gin Ser Gin Leu ser Thr Thr He Pro ser Asn Thr Pro 
110 115 120 125 
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ggt gqa ggc ggc aae cea ccc aag gtc tec ate ccc gga tac gtc teg 576 
Gly Gly Gly Gly Asn Pro Pro Lys Val ser lie Pro Gly Tyr val ser 
160 165 170 

agg aae gac ccc ggt etc ace gtc aae ate cae tgg ccc ate ccc acc 624 
Arg Asn Asp Pro Gly Leu Thr Val Asn lie His Trp Pro lie Pro Thr 
175 180 185 

tee tac acc gtc ccc gga ccc cgc ccc tgg agg gga 660 
Ser Tyr Thr Val Pro Gly Pro Arg Pro Trp Arg Gly 
190 195 200 
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Ser Gly Glu Tyr Leu Leu Arg lie Glu His lie Gly Leu His val Ala 
130 135 140 

Gly Ala Pro Gin Tyr Tyr lie Ser cys Ala Gin lie Arg Val Asn Gly 
145 150 155 

Gly Gly Gly Gly Asn Pro Pro Lys val ser lie Pro Gly Tyr val Ser 
160 165 170 . 

Arg Asn Asp Pro Gly Leu Thr val Asn He His Trp pro He Pro Thr 
175 180 185 



Ser Tyr Thr val Pro Gly Pro Arg Pro Trp Arg Gly 
190 195 200 

<210> 7 
<211> 30 
<212> DNA 

<213> Primer A for SEQ id no: 3 
<400> 7 

gcggaattca tcatgaggcc cttctccctc 30 

<210> 8 
<211> 32 
<212> DNA 

<213> Primer B for SEQ ID NO: 3 

<40a> 8 

atttgcggcc gcttcccgtc atcctctaag gc 32 

<210> 9 
<211> 29 
<212> DNA 

<213> Primer A for SEQ id no: 1 
<400> 9 

gcggaattca tgaagctcac cacctcggt 29 

<210> 10 
<211> 32 
<212> DNA 

<213> Primer B for SEQ ID NO: 1 
<400> 10 

atttgcggcc gcgcagccaa ccaacctgga at 32 

<210> 11 
<211> 30 
<212> DNA 

<213> Primer A for SEQ ID NO: 5 
<400> 11 

gcggaattca caatgaaggt cttcgcatac 30 



<210> 12 
<211> 32 



Page 8 



10274.ST25.txt 



<2i3> Primer B for SEQ ID NO: 5 

<400> 12 ,4. 
atttgcggcc gcacgatgcg atgagcattt at 
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